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CRYSTALLOGRAPHIC STRUCTURE OF THE ANDROGEN RECEPTOR 

LIGAND BINDING DOMAIN 

Field of Invention 

The present invention relates to compositions and crystals of 
5 androgen receptor ligand binding domain optionally in complex with its 
ligand. This invention also relates to methods of using the structure 
coordinates of the androgen receptor ligand binding domain /ligand 
complex to solve the structure of similar or homologous proteins or 
protein complexes. This invention also relates to methods for designing 

10 and selecting ligands that bind to the androgen receptor and methods of 
using such ligands. 
Background of the Invention 

The androgen receptor (AR) is a member of the steroid nuclear- 
receptor superfamily of ligand-dependent transcription factors. The 

15 binding of androgen to AR initiates the gene activation required for male 
sex development. 

AR is an important target primarily in two drug discovery areas. 
In oncology drug discovery, inhibitors (antagonists or partial antagonists) 
of androgen receptor function are useful for treatment of anti-androgen 

20 refractory prostate cancer. In metabolic diseases drug discovery, agonists 
or partial agonists to the androgen receptor in muscle are useful to treat 
age-related diseases. 

As with the other members of the steroid receptor family, AR has 
several functional domains including a DNA binding domain (DBD), and 

25 a 261 residue ligand-binding domain (LBD) (Mw = 30,245 Da) which 
contains the androgen binding site, and is responsible for switching on 
the androgen function. 

Development of synthetic ligands that specifically bind to 
androgen receptors has been largely guided by trial and error method of 

30 drug design despite the importance of the androgen receptor in 

physiological processes and medical conditions such as prostate cancer 
and modulation of reproductive organ modulation. Previously, new 
ligands specific for androgen receptors were discovered in the absence of 
information on the three dimensional structure of the androgen receptor 

35 with a bound ligand. Before the present invention, researchers were 
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essentially discovering androgen receptor ligands by probing in the dark 
and without the ability to visualize how the amino acids of the androgen 
receptor held a ligand in its grasp. 

Consequently, it would be advantageous to devise methods and 
5 compositions for reducing the time required to discover ligands to the 
androgen receptor, synthesize such compounds and administer such 
compounds to organisms to modulate physiological processes regulated 
- by the androgen receptor. 

The cDNA and amino acid sequences of human and rat 
10 androgen receptors have been described (Proc. Natl. 

Acad. Sci. U.S.A. 1988 85: 7211-7215). However, there have been no 
crystals reported of any androgen receptor. Thus, x-ray crystallographic 
analysis of such proteins has not been possible. 

We have discovered the first crystal structure of the androgen 
15 receptor ligand binding domain (AR-LBD). Our understanding or the 
androgen receptor structure has allowed for the determination of the 
ligand binding site for selective androgen receptor modulators (SARMs). 
Summary of the Invention 

The present invention provides crystals of AR-LBD and crystals 
20 of an AR-LBD bound to a ligand, i.e. an AR-LBD/ AR-LBD ligand 

complex. Most preferably the AR-LBD ligand is dihydrotestosterone 
(DHT). Thus, the present invention is directed to a crystal of an AR-LBD 
comprising: 

1) an AR-LBD and an AR-LBD ligand or 
25 2) an AR-LBD without an AR-LBD ligand; 

wherein said crystal diffracts to at least 3 angstrom resolution and has a 
crystal stability within 5% of its unit cell dimensions. The crystal of AR 
or AR-LBD preferably has at least 200 amino acid and preferably 
comprises amino acid sequence 672 to 917 of rat AR or the AR amino 
30 acid sequence 672 to 9 17 of human AR. 

The present invention also provides the structure coordinates of 
the AR-LBD/ AR-LBD ligand complex. The complete coordinates are 

listed in Table A. 

The present invention also provides a method for determining at 
35 least a portion of the three-dimensional structure of molecules or 

molecular complexes which contain at least some structurally similar 
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features to the androgen receptor ligand binding domain. It is preferred 
that these molecules or molecular complexes comprise at least a part of 
the ligand binding site defined by structure coordinates of AR-LBD 
amino acids V685, L700, L701, S702, S703, L704, N705, E706, L707, 
G708, E709, Q711, A735, 1737, Q738, Y739, S740, W741, M742, G743, 
L744,M745, V746, F747, A748, M749, G750, R752, Y763, F764, A765, 
L76s', F770, M780, M787, 1869, L873, H874, F876, T877 and F878 
according to Table A, or a mutant or homologue thereof. Since the 
protein sequences for rat and human AR LBD are identical, the human 
numbering system was used herein. 

The present invention also provides a machine-readable data 
storage medium which comprises a data storage material encoded with 
machine readable data defined by the structure coordinates of an AR- 
LBD /AR-LBD ligand or ligand complex according to Table A or a 
homologue of the complex. 

The present invention further provides a binding site in AR-LBD 
for an AR-LBD ligand as well as methods for designing or selecting AR 
modulators including agonists, partial agonists, antagonists, partial 
antagonists and /or selective androgen receptor modulators (SARMs) of 
AR using information about the crystal structures disclosed herein. 
Brief Description of the Drawing 

Figure 1 is a ribbon style drawing of the Androgen Receptor 
LBD. The substrate DHT is shown as a ball-and-stick figure. 

Figure 2 is a comparison of the androgen receptor ligand 
binding domain with progesterone receptor ligand binding domain. 

Figure 3 provides three views of the omit electron density map of 
dihydrotestosterone (DHT) in the hormone-binding site of AR-LBD. There 
are hydrogen bonds between the steroid and the side chains of Arg 752 
and Asn 705. 

Figure 4 is a comparison of the binding of dihydrotestosterone to 
AR-LBD (top) and of progesterone to PR- LBD (bottom) . Note that an 
additional hydrogen bond interaction would be possible if both the 
sidechains of both N719 and the progesterone were flipped. 
Detailed Description of the Invention 

The first crystal structure of the androgen receptor ligand 
binding domain (AR-LBD) has been determined to 2.0 A resolution. 
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Crystals of rat AR-LBD were grown from precipitating solutions 
containing 0.9 M Sodium Tartrate, O.lMNa Hepes, pH 7.5. X-ray 
diffraction from the crystals have the symmetry and systematic absences 
of the orthorhombic space group P2i2i2l with unit cell dimensions a = 

5 56.03 A , b = 66.27 A, c= 70.38 A, and one molecule per asymmetric 
unit (Mathews Volume = 2.16 A 3 Da- 1 ). The structure was determined by 
the method of molecular replacement using the structure of the 
Progesterone Receptor LBD (PR-LBD) as the search model. 

The complex of AR-LBD with dihydrotestosterone (DHT) shows 

10 the mode of binding of the steroid to the receptor in the agonist 
conformation. 
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Dihydrotestosterone 
The following abbreviations are used throughout the application: 



A = Ala = Alanine 

V = Val = Valine 
20 L = Leu - Leucine 

I = He = Isoleucine 

P = Pro = proline 

F = Phe = phenylalanine 

W = Trp = Tryptophan 
25 M = Met = Methionine 

G = Gly = Glycine 

S = Ser = Serine 

T = Thr = Threonine 

C = Cys = Cysteine 
30 Y = Tyr = Tyrosine 

N =Asn = Asparagine 
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Q =Gln = Glutamine 
D = Asp = Aspartic Acid 
E = GIu = Glutamic Acid 
K = Lys = Lysine 
R = Arg = Arginine 
H = His = Histidine 

"Atom type" refers to the element whose coordinates have been 
determined. Elements are defined by the first letter in the column. 

"X, Y, Z" crystallographically define the atomic position determined for 
each atom. 

"B" is a thermal factor that measures movement of the atom around its 
atomic center. 

"Occ" is an occupancy factor that refers to the fraction of the molecules 
in which each atom occupies the position specified by the coordinates. A 
value of "1" indicates that each atom has the same conformation, i.e., the 
same position, in all molecules of the crystal. 

Additional definitions are set forth in the specification where necessary. 

The androgen receptor (AR) described herein is intended to 
include any polypeptide which has the activity of the naturally occurring 
androgen receptor . The AR and AR-LBD contemplated herein includes 
all vertebrate and mammalian forms such as rat, mouse, pig, goat, 
horse, guinea pig, rabbit, monkey, orangutan and human. Such terms 
also include polypeptides that differ from naturally occurring forms of 
AR and AR-LBD by having amino acid deletions, substitutions, and 
additions, but which retain the activity of AR and AR-LBD, respectively. 
The crystal structure of the invention preferably contains at least 25%, 
more preferably at least 50%, more preferably at least 75%, more 
preferably at least 90%, more preferably at least 95%, more preferably at 
least 99%, and most preferably all of the coordinates listed in Table A. 
The crystal of the AR-LBD/ AR-LBD ligand of the invention preferably 
has the following unit celt dimensions in angstroms: a = 56.03 ± 5% , b 
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= 66.27 ± 5%, c = 70.38 ± 5% and an orthorhombic space group 
P212121. 

The AR-LBD ligand of this invention is any peptide, peptide 
mimetic or nonpeptide, including small organic molecules, that is 
5 capable of acting as a ligand for AR-LBD. In a preferred embodiment, 
the AR-LBD ligand is an AR modulator. By a AR modulator" it is meant 
an agonist or activator, a partial agonist or partial activator, an 
antagonist or inhibitor, or a partial antagonist or partial inhibitor which 
demonstrates tissue specific activations of the AR. Such compounds are 

10 also referred to herein as SARMs (selective androgen receptor 

. modulators) of the AR-LBD. Examples of preferred agonists include 
androgens such as dihydrotestosterone. 

The peptides referred to herein (e.g., AR, AR-LBD, and the like) 
may be produced by any well-known method, including synthetic 

15 methods, such as solid phase, liquid phase and combination solid 

phase/liquid phase syntheses; recombinant DNA methods, including 
cDNA cloning, optionally combined with site directed mutagenesis; 
and/ or purification of the natural products, optionally combined with 
enzymatic cleavage methods to produce fragments of naturally occurring 

20 Advantageously, the crystallizable compositions provided by this 

invention are amenable to x-ray crystallography. Thus, this invention 
also provides the three-dimensional structure of the AR-LBD /AR-LBD 
ligand complex, particularly the complex of rat AR-LBD with 
dihydrotestosterone. 

25 The three-dimensional structure of the AR-LBD / 

dihydrotestosterone complex of this invention is defined by a set of 
structure coordinates as set forth in Table A. The term "structure 
coordinates" refers to Cartesian coordinates derived from mathematical 
equations related to the patterns obtained on diffraction of a 

30 monochromatic beam of X-rays by the atoms (scattering centers) of an 
androgen receptor/ dihydrotestosterone complex in crystal form. The 
diffraction data are used to calculate an electron density map of the 
repeating unit of the crystal. The electron density maps are then used to 
establish the positions of the individual atoms of the complex. 

35 Those of skill in the art will understand that a set of structure 

coordinates for a receptor or receptor/ ligand complex or a portion 
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thereof, is a relative set of points that define a shape in three 
dimensions. Thus, it is possible that an entirely different set of 
coordinates could define a similar or identical shape. Moreover, slight 
variations in the individual coordinates will have little effect on overall 
5 shape. 

The variations in coordinates discussed above may be generated 
because of mathematical manipulations of the structure coordinates. For 
example, the structure coordinates set forth in Table A could be 
manipulated by crystallographic permutations of the structure 

10 coordinates, fractionalization of the structure coordinates; integer 

additions or subtractions to sets of the structure coordinates, inversion 
of the structure coordinates or any combination of the above. 

Alternatively, modifications in the crystal structure due to 
mutations, additions, substitutions, and/or deletions of amino acids, or 

15 other changes in any of the components that make up the crystal could 
also account for variations in structure coordinates. If such variations 
are within an acceptable standard error as compared to the original 
coordinates, the resulting three-dimensional shape is considered to be 
the same. 

20 Various computational analyses are therefore necessary to 

determine whether a molecule or molecular complex or a portion thereof 
is sufficiently similar to all or parts of the androgen 
receptor/ dihydrotestosterone described above as to be considered the 
same. Such analyses may be carried out in current software 

25 applications, such as the Molecular Similarity application of QUANTA 
(Molecular Simulations Inc., San Diego, CA) version 4.1, and as 
described in the accompanying User's Guide. 

The Molecular Similarity application permits comparisons 
between different structures, different conformations of the same 

30 structure, and different parts of the same structure. The procedure used 
in Molecular Similarity to compare structures is divided into four steps: 
1) load the structures to be compared; 2) define the atom equivalences in 
these structures; 3) perform a fitting operation; and 4) analyze the 
results. 

35 Each structure is identified by a name. One structure is 

identified as the target (i.e., the fixed structure); all remaining structures 
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are working structures (i.e., moving structures). Since atom equivalency 
within QUANTA is defined by user input, for the purpose of this 
invention we will define equivalent atoms as protein backbone atoms (N, 
Cs, C and O) for all conserved residues between the two structures being 
5 compared. We will also consider only rigid fitting operations. 

When a rigid fitting method is used, the working structure is 
translated and rotated to obtain an optimum fit with the target 
structure. The fitting operation uses an algorithm that computes the 
optimum translation and rotation to be applied to the moving structure, 
10 such that the root mean square difference of the fit over the specified 

pairs of equivalent atom is an absolute minimum. This number, given in 
angstroms, is reported by QUANTA. 

For the purpose of this invention, any molecule or molecular 
complex that has a root mean square deviation of conserved residue 
1 5 backbone atoms (N, Ca, C, O) of less than 1.5 A when superimposed on 
the relevant backbone atoms described by structure coordinates listed in 
Table A are considered identical. More preferably, the root mean square 
deviation is less than 1.0 A. In a preferred embodiment of the present 
invention, the molecule or molecular complex comprises at least a 
20 portion of the ligand binding site defined by structure coordinates of AR- 
LBD amino acids V685, L700, L701, S702, S703, L704, N705, E706, 
L707, G708, E709, Q711, A735, 1737, Q738, Y739, S740, W741, M742, 
G743, L744, M745, V746, F747, A748, M749, G750, R752, Y763, F764, 
A765, L768, F770, M780, M787, 1869, L873, H874, F876, T877 and 
25 F878 according to Table A, or a mutant or homologue of said molecule or 
molecular complex. More preferred are molecules or molecular 
complexes comprising all or any part of the ligand binding site defined by 
structure coordinates of AR-LBD amino acids N705, Q71 1, R752, F764 
and T877 according to Table A, or a mutant or homologue of said 
30 molecule or molecular complex. Since the protein sequences for rat and 
human AR LBD are identical, the human numbering system has been 
used herein. 

The term "complex" or "molecular complex" means AR-LBD or a 
mutant or homologue of AR-LBD in a covalent or non-covalent 
35 association with a chemical entity or compound. 
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For purposes of the present invention, by "at least a portion of 
it is meant all or any part of the ligand binding site defined by these 
structure coordinates. 

By "mutant or homologue" as used herein it is meant a molecule 
5 or molecular complex having a similar structure and/ or sequences to 
AR-LBD. By "similar structure" it is meant a mutant or homologue 
having a binding pocket that has a root mean square deviation from the 
backbone atoms of said AR-LBD amino acids of not more than 1.5 
Angstroms. By "similar sequence" it is meant a mutant or homologue 
1 0 having 30%, or more preferably 75%, identity with AR-LBD. 

The term "root mean square deviation" means the square root of 
the arithmetic mean of the squares of the deviations from the mean. It is 
a way to express the deviation or variation from a trend or object. For 
purposes of this invention, the "root mean square deviation" defines the 
1 5 variation in the backbone of a protein or protein complex from the 
relevant portion of the backbone of the AR portion of the complex as 
defined by the structure coordinates described herein. 

Once the structure coordinates of a protein crystal have been 
determined they are useful in solving the structures of other crystals. 
20 Thus, in accordance with the present invention, the structure 

coordinates of an androgen receptor/dihydrotestosterone complex, and 
in particular a complex, and portions thereof is stored in a machine- 
readable storage medium. Such data may be used for a variety of 
purposes, such as drug discovery and x-ray crystallographic analysis or 

25 protein crystal. 

Accordingly, in one embodiment of this invention is provided a 
machine-readable data storage medium comprising a data storage 
material encoded with the structure coordinates set forth in Table A. 
One embodiment utilizes System 10 as disclosed in WO 
30 98/1 1 134, the disclosure of which is incorporated herein by reference in 
its entirety 

For the first time, the present invention permits the use of 
structure-based or rational drug design techniques to design, select, and 
synthesize chemical entities, including inhibitory and stimulatory 
35 compounds that are capable of binding to AR-LBD, or any portion 
thereof. 
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One particularly useful drug design technique enabled by this 
invention is iterative drug design. Iterative drug design is a method for 
optimizing associations between a protein and a compound by 
determining and evaluating the three-dimensional structures of 
5 successive sets of protein /compound complexes. 

Those of skill in the art will realize that association of natural 
ligands or substrates with the binding pockets of their corresponding 
receptors or enzymes is the basis of many biological mechanisms of 
action. The term "binding pocket" as used herein, refers to a region of a 

10 molecule or molecular complex, that, as a result of its shape, favorably 
associates with another chemical entity or compound. Similarly, many 
drugs exert their biological effects through association with the binding 
pockets of receptors and enzymes. Such associations may occur with all 
or any parts of the binding pockets. An understanding of such 

1 5 associations will help lead to the design of drugs having more favorable 
associations with their target receptor or enzyme, and thus, improved 
biological effects. Therefore, this information is valuable in designing 
potential ligands or inhibitors of receptors or enzymes, such as inhibitors 
ofAR. 

20 The term "associating with" refers to a condition of proximity 

between chemical entities or compounds, or portions thereof. The 
association may be non-covalent -- wherein the juxtaposition is 
energetically favored by hydrogen bonding or van der Waals or 
electrostatic interactions - or it may be covalent. 

25 In iterative drug design, crystals of a series of 

protein /compound complexes are obtained and then the three- 
dimensional structures of each complex is solved. Such an approach 
provides insight into the association between the proteins and 
compounds of each complex. This is accomplished by selecting 

30 compounds with inhibitory activity, obtaining crystals of this new 

protein/ compound complex, solving the three dimensional structure of 
the complex, and comparing the associations between the new 
protein /compound complex and previously solved protein/compound 
complexes. By observing how changes in the compound affected the 

35 protein/ compound associations, these associations may be optimized. 
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In some cases, iterative drug design is carried out by forming 
successive protein-compound complexes and then crystallizing each new 
complex. Alternatively, a pre-formed protein crystal is soaked in the 
presence of an inhibitor, thereby forming a protein /compound complex 
and obviating the need to crystallize each individual protein/ compound 
complex. 

As used herein, the term "soaked" refers to a process in which 
the crystal is transferred to a solution containing the compound of 
interest. 

The structure coordinates set forth in Table A can also be used 
to aid in obtaining structural information about another crystallized 
molecule or molecular complex. This may be achieved by any of a 
number of well-known techniques, including molecular replacement. 

The structure coordinates set forth in Table A can also be used 
for determining at least a portion of the three-dimensional structure of 
molecules or molecular complexes which contain at least some 
structurally similar features to AR. In particular, structural information 
about another crystallized molecule or molecular complex may be 
obtained. This may be achieved by any of a number of well-known 
techniques, including molecular replacement. 

Therefore, in another embodiment this invention provides a 
method of utilizing molecular replacement to obtain structural 
information about a crystallized molecule or molecular complex whose 
structure is unknown comprising the steps of: 

a) generating an X-ray diffraction pattern from said crystallized molecule 
or molecular complex; 

b) applying at least a portion of the structure coordinates set forth in 
Table A to the X-ray diffraction pattern to generate a three-dimensional 
electron density map of the molecule or molecular complex whose 
structure is unknown; and 

c) using all or a portion of the structure coordinates set forth in Table A 
to generate homology models of AR-LBD or any other nuclear hormone 
receptor ligand binding domain. 

Preferably, the crystallized molecule or molecular complex is 
obtained by soaking a crystal of this invention in a solution. 
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By using molecular replacement, all or part of the structure 
coordinates of the AR-LBD/ AR-LBD ligand complex provided by this 
invention or molecular complex whose structure is unknown more 
quickly and efficiently than attempting to determine such information ab 
initio. 

Molecular replacement provides an accurate estimation of the 
phases for an unknown structure. Phases are a factor in equations used 
to solve crystal structures that can not be determined directly. Obtaining 
accurate values for the phases, by methods other than molecular 
replacement, is a time-consuming process that involves iterative cycles of 
approximations and refinements and greatly hinders the solution of 
crystal structures. However, when the crystal structure of a protein 
containing at least a homologous portion has been solved, the phases 
from the known structure provide a satisfactory estimate of the phases 
for the unknown structure. 

Thus, this method involves generating a preliminary model of a 
molecule or molecular complex whose structure coordinates are 
unknown, by orienting and positioning the relevant portion of the AR- 
LBD /AR-LBD ligand complex according to Table A within the unit cell of 
the crystal of the unknown molecule or molecular complex so as best to 
account for the observed X-ray diffraction pattern of the crystal of the 
molecule or molecular complex whose structure is unknown. Phases can 
then be calculated from this model and combined with the observed X- 
ray diffraction pattern amplitudes to generate an electron density map of 
the structure whose coordinates are unknown. This, in turn, can be 
subjected to any well-known model building and structure refinement 
techniques to provide a final, accurate structure of the unknown 
crystallized molecule or molecular complex [E. Lattman, "Use of the 
Rotation and Translation Functions", in Meth. Enzymol., 115, pp. 55-77 
(1985); M. G. Rossmann, ed., "The Molecular Replacement Method", Int. 
Sci. Rev. Set., No. 13, Gordon & Breach, New York (1972)]. 

The structure of any portion of any crystallized molecule or 
molecular complex, or mutant, homologue or orphan receptor that is 
sufficiently homologous to any portion of the AR-LBD /AR-LBD ligand 
complex can be solved by this method. Along with the aforementioned 
AR, there also exist a number of AR for which the activating or 
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deactivating ligands may not be characterized. These proteins are 
classified as AR due to strong sequence homology to other AR, and are 
known as orphan receptors. 

The structure coordinates are also particularly useful to solve 
5 the structure of crystals of AR-LBD/ AR-LBD ligand co-complexed with a 
variety of chemical entities. This approach enables the determination of 
the optimal sites for interaction between chemical entities, including 
interaction of candidate AR inhibitors with the complex. For example, 
high resolution X-ray diffraction data collected from crystals exposed to 
10 different types of solvent allows the determination of where each type of 
solvent molecule resides. Small molecules that bind tightly to these sites 
can then be designed and synthesized and tested for their AR inhibition 
activity. 

All of the complexes referred to above may be studied using well- 

1 5 known X-ray diffraction techniques and may be refined versus 1 .5-3 A 
resolution X-ray data to an R value of about 0.20 or less using computer 
software, such as X-PLOR (Yale University, 1992, distributed by 
Molecular Simulations, Inc.; see, e.g., Blundell & Johnson, supra; Meth. 
Enzymol., vol. 1 14 & 115, H. W. Wyckoff et al., eds., Academic Press 

20 (1985)]. This information may thus be used to optimize known AR 

agonists, partial agonists, antagonists, partial antagonists and SARMS, 
and more importantly, to design new AR agonists/ antagonists. 

Accordingly, the present invention is also directed to a binding 
site in AR-LBD for an AR-LBD ligand in which a portion of AR-LBD 

25 ligand is in van der Walls contact or hydrogen bonding contact with at 
least one of the following residues: V685, L700, L701, S702, S703, L704, 
N705, E706, L707, G708, E709, Q711, A735, 1737, Q738, Y739, S740, 
W741, M742, G743, L744, M745, V746, F747, A748, M749, G750, R752, 
Y763, F764, A765, L768, F770, M780, M787, 1869, L873, H874, F876, 

30 T877, F878, L880, L881, V889, F891, P892, E893, M894, M895, A896, 
E897, 1898, 1899, S900, V901, Q902, V903, P904 or 1906 of AR-LBD. 
For purposes of this invention, by AR-LBD binding site it is also meant to 
include mutants or homologues thereof. In a preferred embodiment, the 
mutants or homologues have at least 25% identity, more preferably 50% 

35 identity, more preferably 75% identity, and most preferably 95% identity 
to residues V685, L700, L701, S702, S703, L704, N705, E706, L707, 
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G708, E709, Q71 1, A735, 1737, Q738, Y739, S740, W741, M742, G743, 
L744. M745, V746, F747, A748, M749, G750, R752, Y763, F764, A765, 
L768', F770, M780, M787, 1869, L873, H874, F876, T877, F878, L880, 
L88l', V889, F891, P892, E893, M894, M895, A896, E897, 1898, 1899, 
S900, V901, Q902, V903, P904 or 1906 of AR-LBD binding sites. 

The present invention is also directed to a machine-readable 
data storage medium, comprising a data storage material encoded with 
machine readable data, wherein the data is defined by the structure 
coordinates of an AR-LBD /AR-LBD ligand according to Table A or a 
homologue of said complex, wherein said homologue comprises 
backbone atoms that have a root mean square deviation from the 
backbone atoms of the complex of not more than 3.0A. Preferably, the 
machine-readable data storage medium, according to the invention, is 
wherein said molecule or molecular complex is defined by the set of 
structure coordinates for AR-LBD/ AR-LBD ligand according to Table A, 
or a homologue of said molecule or molecular complex, said homologue 
having a root mean square deviation from the backbone atoms of said 
amino acids of not more than 2.0 A. In a preferred embodiment the 
machine-readable data storage medium comprises a data storage 
material encoded with a first set of machine readable data comprising a 
Fourier transform of at least a portion of the structural coordinates for 
an AR-LBD/ AR-LBD ligand according to Table A; which, when combined 
with a second set of machine readable data comprising an X-ray 
diffraction pattern of a molecule or molecular complex of unknown 
structure, using a machine programmed with instructions for using said 
first set of data and said second set of data, can determine at least a 
portion of the structure coordinates corresponding to the second set of 
machine readable data, said first set of data and said second set of data. 

The present invention also provides for computational methods 
using three dimensional models of the androgen receptor that are based 
on crystals of AR-LBD/ AR-LBD ligand complex. Generally, the 
computational method of designing an androgen receptor ligand 
determines which amino acid or amino acids of the AR-LBD interact with 
a chemical moiety (at least one) of the ligand using a three dimensional 
model of a crystallized protein comprising the AR-LBD with a bound 
ligand, and selecting a chemical modification (at least one) of the 
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chemical moiety to produce a second chemical moiety with a structure 
that either decreases or increases an interaction between the interacting 
amino acid and the second chemical moiety compared to the interaction 
between the interacting amino acid and the corresponding chemical 

5 moiety on the natural hormone. 

The computational methods of the present invention are for 
designing androgen receptor synthetic ligands using such crystal and 
three dimensional structural information to generate synthetic ligands 
that modulate the conformational changes of the androgen receptor's 

10 LBD. These computational methods are particularly useful in designing 
an agonist, partial agonist, antagonist or partial antagonist or SARMs to 
the androgen receptor, wherein the agonist, partial agonist, antagonist or 
partial antagonist or SARMS has an extended moiety that prevents any 
one of a number of ligand-induced molecular events that alter the 

15 receptor's influence on the regulation of gene expression, such as 

preventing the normal coordination of the activation domain observed for 
a naturally occurring ligand or other ligands that mimic the naturally 
occurring ligand, such as an agonist. As described herein, synthetic 
ligands of the androgen receptor will be useful in modulating androgen 

20 receptor activity in a variety of medical conditions. 

AR is known to comprise various domains as follows: 

1) a variable amino-terminal domain; 

2) a highly conserved DNA-binding domain (DBD); and 

3) a less conserved carboxyl-terminal ligand-binding domain (LBD). 

25 This modularity permits different domains of each protein to separately 
accomplish different functions, although the domains can influence each 
other. The separate function of a domain is usually preserved when a 
particular domain is isolated from the remainder of the protein. Using 
conventional protein chemistry techniques a modular domain can 

30 sometimes be separated from the parent protein. Using conventional 
molecular biology techniques each domain can usually be separately 
expressed with its original function intact or chimerles of two different 
nuclear receptors can be constructed, wherein the chimetics retain the 
properties of the individual functional domains of the respective nuclear 

35 receptors from which the chimerica were generated. 
Amino Terminal Domain 
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The amino terminal domain is the least conserved of the three 
domains. This domain is involved in transcriptional activation and in 
some cases its uniqueness may dictate selective receptor-DNA binding 
and activation of target genes by specific receptor isoforms. This domain 
5 can display synergistic and antagonistic interactions with the domains of 
the LBD. For example, studies with mutated and/or deleted receptors 
show positive cooperativity of the amino and carboxy terminal domains. 
In some cases, deletion of either of these domains will abolish the 
receptor's transcriptional activation functions. 
10 DNA-Binding Domain 

The DBD is the most conserved domain. The DBD contains two 
perpendicularly oriented a-helixes that extend from the base of the first 
and second zinc fingers. The two zinc fingers function in concert along 
with non-zinc finger residues to direct nuclear receptors to specific target 
1 5 sites on DNA and to align receptor homodimer or heterodimer interfaces. 
Various amino acids in DBD influence spacing between two half-sites for 
receptor dimer binding. 
Ligand or AR Binding Domain 

The LBD is the second most highly conserved domain. Whereas 
20 integrity of several different LBD sub-domains is important for ligand 
binding, truncated molecules containing only the LBD retain normal 
ligand-binding activity. This domain also participates in other functions, 
including dimerization, nuclear translocation and transcriptional 
activation. Importantly, this domain is the binding site for ligands, i.e. 
25 AR modulators, and undergoes ligand-induced conformational changes 
as detailed herein. 

As described herein, the LBD of AR can be expressed, 
crystallized, its three dimensional structure determined with a ligand 
bound (either using crystal data from the same receptor or a different 
30 receptor or a combination thereof), and computational methods used to 
design ligands to its LBD, particularly ligands that contain an extension 
moiety that coordinates the activation domain of AR. 

Once a computationally designed ligand (CDL) is synthesized, it 
can be tested using assays to establish its activity as an agonist, partial 
35 agonist, antagonist or partial antagonist or SARM, and affinity, as 

described herein. After such testing, the CDLs can be further refined by 
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generating LBD crystals with a CDL bound to the LBD. The structure of 
the CDL can then be further refined using the chemical modification 
methods described herein for three dimensional models to improve the 
activity or affinity of the CDL and make second generation CDLs with 
5 improved properties, such as that of a super agonist or antagonist. 

Typically AR-LBD is purified to homogeneity for crystallization. 
Purity of AR-LBD is measured with SDS-PAGE, mass spectrometry and 
hydrophobic HPLC. The purified AR for crystallization should be at least 
97.5 % pure or 97.5%, preferably at least 99.0% pure or 99.0% pure, 
1 0 more preferably at least 99.5% pure or 99.5% pure. 

Initially purification of the unliganded receptor can be obtained 
by conventional techniques, such as hydrophobic interaction 
chromatography (HPLC), ion exchange chromatography (HPLC), and 
heparin affinity chromatography. 
1 5 To achieve higher purification for improved crystals of AR, it will 

be desirable to ligand shift purify the nuclear receptor using a column 
that separates the receptor according to charge, such as an ion exchange 
or hydrophobic interaction column, and then bind the eluted receptor 
with a ligand, especially an agonist or partial agonist. The ligand induces 
20 a change in the receptor's surface charge such that when re- 

chromatographed on the same column, the receptor then elutes at the 
position of the liganded receptor are removed by the original column run 
with the unliganded receptor. Usually saturating concentrations of 
ligand are used in the column and the protein can be preincubated with 
25 the ligand prior to passing it over the column. 

More recently developed methods involve engineering a "tag" 
such as with histidine placed on the end of the protein, such as on the 
amino terminus, and then using a nickle chelation column for 
purification, Janknecht R., Proc. Natl. Acad.Sci. USA Vol 88:8972-8976 
30 (1991) incorporated by reference . 

To determine the three dimensional structure of a AR-LBD, it is 
desirable to co-crystalize the LBD with a corresponding LBD ligand. 

Typically purified AR-LBD is equilibrated at a saturating 
concentration of ligand at a temperature that preserves the integrity of 
35 the protein. Ligand equilibration can be established between 2 and 37° 
C, although the receptor tends to be more stable in the 2-20° C range. 



/ 
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Preferably crystals are made with the hanging drop methods. 
Regulated temperature control is desirable to improve crystal stability 
and quality. Temperatures between 4 and 25°C are generally used and it 
is often preferable to test crystallization over a range of temperatures. It 
is preferable to use crystallization temperatures from 18 to 25°C, more 
preferably 20 to 23°C, and most preferably 22°C. 

Ligands that interact with AR can act as an agonist, partial 
agonist, antagonist or partial antagonist or SARM based on what ligand- 
induced conformational changes take place. 

Agonists or partial agonists induce changes in receptors that 
place them in an active conformation that allows them to influence 
transcription, either positively or negatively. There may be several 
different ligand-induced changes in the receptor's conformation. 

Antagonists or partial antagonists bind to receptors, but fail to 
induce conformational changes that alter the receptor's transcriptional 
regulatory properties or physiologically telcram conformations. Binding 
of an antagonist or partial antagonist can also block the binding and 
therefore the actions of an agonist or partial agonist. 

Partial agonists, or partial antagonists, bind to receptors and 
induce only part of the changes in the receptors that are induced by 
agonists or antagonists, respectively. The differences can be qualitative 
or quantitative. Thus, a partial agonist or partial antagonist may induce 
some of the conformation changes induced by agonists or antagonists, 
respectively, but not others, or it may only induce certain changes to a 
limited extent. 

As described herein, the unliganded receptor is in a 
configuration that is either inactive, has some activity or has repressor 
activity. Binding of agonist ligands induces conformational changes in 
the receptor such that the receptor becomes more active, either to 
stimulate or repress the expression of genes. The receptors may also 
have non-genomic actions, some of the known types of changes and/ or 
the sequelae of these are listed herein. 

Heat shock protein binding domains present a region for binding 
to the LBD and can be modulated by the binding of a ligand to the LBD. 
Consequently, an extended chemical moiety (or more) from the ligand 
that stabilizes the binding or comact of the heat shock protein binding 
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domain with the LBD can be designed. Typically such chemical 
moieties will extend past and away from the molecular recognition 
domain on the ligand and usually past the buried binding cavity of the 
ligand. 

Ligand binding by the receptor is a dynamic process, which 
regulates receptor function by inducing an altered conformation. 

The three-dimensional structure of the liganded AR receptor will 
greatly aid in the development of new AR synthetic ligands. In addition, 
AR is overall well suited to modern methods including three-dimensional 
structure elucidation and combinatorial chemistry such as those 
disclosed in EP 335 628, U.S. patent 5,463,564, which are incorporated 
herein by reference. Computer programs that use crystallography data 
when practicing the present invention will enable the rational design of 
ligand to AR. Programs such as RASMOL can be used with the atomic 
coordinates from crystals generated by practicing the invention or used 
to practice the invention by generating three dimensional models and/ or 
determining the structures involved in ligand binding. Computer 
programs such as INSIGHT and GRASP allow for further manipulation 
and the ability to introduce new structures. In addition, high throughput 
binding and bioactivity assays can be devised using purified recombinant 
protein and modern reporter gene transcription assays described herein 
and known in the art in order to refine the activity of a CDL. 

Generally the computational method of designing an AR 
synthetic ligand comprises two steps: 

1) determining which amino acid or amino acids of AR- LBD interacts 
with a first chemical moiety (at least one) of the ligand using a three 
dimensional model of a crystallized protein comprising an AR-LBD with a 
bound ligand; and 

2) selecting a chemical modifications (at least one) of the first chemical 
moiety to produce a second chemical moiety with a structure to either 
decrease or increase an interaction between the interacting amino acid 
and the second chemical moiety compared to the interaction between the 
interacting amino acid and the first chemical moiety. 

Preferably the method is carried out wherein said three dimensional 
model is generated by comparing isomorphous ligand derivatives to 
produce improved phasing. Further preferred is wherein said method 
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comprises determining a change in interaction between said interacting 
amino acid and said ligand after chemical modification of said first 
chemical moiety, especially wherein said three dimensional model is 
generated by comparing isomorphous ligand derivatives to produce 
improved phasing. Also preferred is wherein said selecting uses said 
first chemical moiety that interacts with at least one of the interacting 
amino acids V685, L700, L701, S702, S703, L704, N705, E706, L707, 
G708, E709, Q711, A735, 1737, Q738, Y739, S740, W741, M742, G743, 
L744 M745, V746, F747, A748, M749, G750, R752, Y763, F764, A765, 
L76s! F770.M780, M787, 1869, L873, H874, F876, T877, F878, L880, 
L88l', V889, F891, P892, E893, M894, M895, A896, E897, 1898, 1899, 
S900, V901, Q902, V903, P904 or 1906. 

As shown herein, interacting amino acids form contacts with the 
ligand and the center of the atoms of the interacting amino acids are 
usually 2 to 4 angstroms away from the center of the atoms of the 
ligand. Generally these distances are determined by computer as 
discussed herein and in McRee 1993, however distances can be 
determined manually once the three dimensional model is made. See 
also Wagner et al., Nature 378(6558):670-697 (1995) for stereochemical 
figures of -three dimensional models. More commonly, the atoms of the 
ligand and the atoms of interacting amino acids are 3 to 4 angstroms 
apart. The invention can be practiced by repeating steps I and 2 to refine 
the fit of the ligand to the LBD and to determine a better ligand, such as 
an agonist, partial agonist, antagonist or partial antagonist or SARM. 
The three dimensional model of AR can be represented in two 
dimensions to determine which amino acids contact the ligand and to 
select a position on the ligand for chemical modification and changing 
the interaction with a particular amino acid compared to that before 
chemical modification. The chemical modification may be made using a 
computer, manually using a two dimensional representation of the three 
dimensional model or by chemically synthesizing the ligand. The ligand 
can also interact with distant amino acids after chemical modification of 
the ligand to create a new ligand. Distant amino acids are generally not 
in contact with the ligand before chemical modification. A chemical 
modification can change the structure of the ligand to make as new 
ligand that interacts with a distant amino acid usually at least 4.5 
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angstroms away from the ligand, preferably wherein said first chemical 
moiety is 6 to 12 angstroms away from a distant amino acid. Often 
distant amino acids will not line the surface of the binding cavity for the 
ligand, they are too far away from the ligand to be part of a pocket or 
5 binding cavity. The interaction between a LBD amino acid and an atom 
of an LBD ligand can be made by any force or attraction described in 
nature. Usually the interaction between the atom of the amino acid and 
the ligand will be the result of a hydrogen bonding interaction, charge 
interaction, hydrophobic interaction, van der Waals interaction or dipole 
10 interaction. In the case of the hydrophobic interaction it is recognized 
that this is not a per se interaction between the amino acid and ligand, 
but rather the usual result, in part, of the repulsion of water or other 
hydrophilic group from a hydrophobic surface. Reducing or enhancing 
the interaction of the LBD and a ligand can be measured by calculating 
1 5 or testing binding energies, computationally or using thermodynamic or 
kinetic methods as known in the art. 

Chemical modifications will often enhance or reduce interactions 
of an atom of a LBD amino acid and an atom of an LBD ligand. Steric 
hindrance will be a common means of changing the interaction of the 
20 LBD binding cavity with the activation domain. 

The present invention also provides methods for identifying 
compounds that modulate androgen receptor activity. Various methods 
or combinations thereof can be used to identify these compounds. For 
example, test compounds can be modeled that fit spatially into the AR- 
25 LBD as defined by structure coordinates according to Table A, or using a 
three-dimensional structural model of AR-LBD, mutant AR-LBD or AR- 
LBD homolog or portion thereof. Structure coordinates of the ligand 
binding site, in particular amino acids V685, L700, L701, S702, S703, 
L704, N705, E706, L707, G708, E709, Q711, A735, 1737, Q738, Y739, 
30 S740, W741, M742, G743, L744, M745, V746, F747, A748, M749, G750, 
R752, Y763, F764, A765, L768, F770, M780, M787, 1869, L873, H874, 
F876. T877, F878, L880, L881, V889, F891, P892, E893, M894, M895, 
A896*, E897, 1898, 1899, S900, V901, Q902, V903, P904 or 1906 can also 
be used to identify structural and chemical features. Identified 
35 structural or chemical features can then be employed to design or select 
compounds as potential AR modulators. By structural and chemical 
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fcatures it is meant to include, but is not limited to, van der Waals 
interactions, hydrogen bonding interactions, charge interaction, 
hydrophobic bonding interaction, hydrophobic interaction and dipole 
interaction. Alternatively, or in conjunction, the three-dimensional 
5 structural model or the ligand binding site can be employed to design or 
select compounds as potential AR modulators. Compounds identified as 
potential AR modulators can then be synthesized and screened in an 
assay characterized by binding of a test compound to the AR-LBD. 
Examples of assays useful in screening of potential AR modulators 
10 include, but are not limited to, screening in silico, in vitro assays and 
high throughput assays. Finally, these methods may also involve 
modifying or replacing one or more amino acids from AR-LBD such as 
V685, L700, L701, S702, S703, L704, N705, E7Q6, L707, G708, E709, 
Q711, A735, 1737, Q738, Y739, S740, W741, M742, G743, L744, M745, 
15 V746, F747, A748, M749, G750, R752, Y763, F764, A765, L768, F770, 
M780, M787, 1869, L873, H874, F876, T877, F878, L880, L881, V889, 
F891, P892, E893, M894, M895, A896, E897, 1898, 1899, S900, V901, 
Q902, V903, P904 or 1906 of AR-LBD according to Table A. 

A preferred method of the invention can be described as a 
20 computational method of designing an androgen receptor antagonist 
from an androgen receptor agonist comprising: 

1) determining a structure of a molecular recognition domain of 
said agonist using a three dimensional model of a crystallized 
protein comprising an AR-LBD, and 
25 2) selecting at least one chemical modification of said agonist 

that provides a ligand structure that extends beyond a 
binding site for said agonist and in the direction of at least 
one protein domain important in AR biological function. 
Another preferred method of the invention can be described as a 
30 computational method of designing a selective androgen receptor 

modulator such as an androgen receptor super agonist or antagonist 
comprising: 

1) determining at least one interacting amino acid of an AR-LBD 
that interacts with at least one first chemical moiety of said 
35 ligand using a three dimensional model of a crystallized 

protein comprising AR-LBD with a bound ligand, and 
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2) selecting at least one chemical modification of said first 
chemical moiety to produce a second chemical moiety with a 
structure to reduce or enhance an interaction between said 
interacting amino acid and said second chemical moiety 
compared to said interaction between said interacting amino 
acid and said first chemical moiety. 



However, as will be understood by those of skill in the art upon this 
disclosure, other structure based design methods can be used. Various 
10 computational structure based design methods have been disclosed in 
the art. 

For example, a number computer modeling systems are 
available in which the sequence of the AR-LBD and the AR-LBD 
structure (i.e., atomic coordinates of AR-LBD and/ or the atomic 

15 coordinates of the active site* the bond and dihedral angles, and 

distances between atoms in the active site such as provided in Table A) 
can be input. This computer system then generates the structural 
details of the site in which a potential AR modulator binds so that 
complementary structural details of the potential modulators can be 

20 determined. Design in these modeling systems is generally based upon 
the compound being capable of physically and structurally associating 
with AR-LBD. In addition, the compound must be able to assume a 
conformation that allows it to associate with AR-LBD. Some modeling 
systems estimate the potential inhibitory or binding effect of a potential 

25 AR modulator prior to actual synthesis and testing. 

Methods for screening chemical entities or fragments for their 
ability to associate with AR-LBD are also well known. Often these 
methods begin by visual inspection of the active site on the computer 
screen. Selected fragments or chemical entities are then positioned with 

30 the AR-LBD. Docking is accomplished using software such as QUANTA 
and SYBYL, following by energy minimization and molecular dynamics 
with standard molecular mechanic forcefields such as CHARMM and 
AMBER. Examples of computer programs which assist in the selection 
of chemical fragment or chemical entities useful in the present invention 

35 include, but are not limited to, GRID (Goodford , P.J. J. Med. Chem. 

1985 28:849-857), AUTODOCK (Goodsell, D.S. and Olsen, A.J. Proteins, 
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Structure, Functions, and Genetics 1990 8:195-202), and DOCK (Kunts 
etal. J. Mol. Biol. 1982 161:269-288). 

Upon selection of preferred chemical entities or fragments, their 
relationship to each other and AR-ABD can be visualized and the entities 
5 or fragments can be assembled into a single potential modulator. 

Programs useful in assembling the individual chemical entities include, 
but are not limited to CAVEAT (Bartlett et al. Molecular Recognition in 
Chemical and Biological Problems Special Publication, Royal Chem. Soc. 
78, 182-196 (1989) ) and 3D Database systems (Martin, Y.C. J. Med. 

10 Chem. 1992 35:2145-2154). 

Alternatively, compounds may be designed de novo using either 
an empty active site or optionally including some portion of a known 
inhibitor. Methods of this type of design include, but are not limited to 
LUDI (Bohm H-J, J. Comp. Aid. Molec. Design 1992 6:61-78) and 

15 LeapFrog (Tripos Associates, St. Louis. MO). 

The present invention is also directed to an AR-LBD selective 
androgen receptor modulator (SARM), in particular an agonist or 
antagonist or partial agonist or partial antagonist, identified by a 
computational process of the invention. 

20 The present invention is further directed to a method for treating 

prostate cancer comprising administering an effective amount of an AR 
modulator, preferably an antagonist or partial antagonist, identified by a 
computational process of the invention. 

The present invention is also direct to a method for treating an 

25 age related disease comprising administering an effective amount of an 
AR modulator, preferably an agonist or partial agonist, identified by a 
computational process of the invention, preferably wherein said age 
related disease is osteoporosis, muscle wasting or loss of libido. 

Compounds identified as agonists, partial agonists, antagonists, 

30 partial antagonists or SARMs by the methods disclosed herein which are 
active when given orally can be formulated as liquids for example syrups, 
suspensions or emulsions, tablets, capsules and lozenges. A liquid 
composition will generally consist of a suspension or solution of the 
compound in a suitable liquid carrier(s), for example ethanol, glycerin, 

35 sorbitol, non-aqueous solvent such as polyethylene glycol, oils or water, 
with a suspending agent, preservative, surfactant, wetting agent, 
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navoring or coloring agent. Alternatively, a liquid formulation can be 
prepared from a reconstitutable powder. For example a powder 
containing active compound, suspending agent, sucrose and a sweetener 
can be reconstituted with water to form a suspension; and a syrup can 

5 be prepared from a powder containing active ingredient, sucrose and a 
sweetener. A composition in the form of a tablet can be prepared using 
any suitable pharmaceutical carrier(s) routinely used for preparing solid 
compositions. Examples of such carriers include magnesium stearate, 
starch, lactose, sucrose, microcrystalline cellulose, binders, for example 

1 0 polyvinylpyrrolidone. The tablet can also be provided with a color film 
coating, or color included as part of the carrier(s). In addition, active 
compound can be formulated in a controlled release dosage form as a 
tablet comprising a hydrophilic or hydrophobic matrix. A composition in 
the form of a capsule can be prepared using routine encapsulation 

1 5 procedures, for example by incorporation of active compound and 

excipients into a hard gelatin capsule. Alternatively, a semi-solid matrix 
of active compound and high molecular weight polyethylene glycol can be 
prepared and filled into a hard gelatin capsule; or a solution of active 
compound in polyethylene glycol or a suspension in edible oil, for 

20 example liquid paraffin or fractionated coconut oil can be prepared and 
filled into a soft gelatin capsule. Compounds identified by the processes 
described herein which are active when given parenterally can be 
formulated for intramuscular or intravenous administration. A typical 
composition for intra-muscular administration will consist of a 

25 suspension or solution of active ingredient in an oil, for example arachis 
oil or sesame oil. A typical composition for intravenous administration 
will consist of a sterile isotonic aqueous solution containing, for example 
active ingredient, dextrose, sodium chloride, a co- solvent, for example 
polyethylene glycol and, optionally, a chelating agent, for example 

30 ethylenediaminetetracetic acid and an anti-oxidant, for example, sodium 
metabisulphite. Alternatively, the solution can be freeze dried and then 
reconstituted with a suitable solvent just prior to 
administration. Identified compounds which are active on rectal 
administration can be formulated as suppositories. A typical suppository 

35 formulation will generally consist of active ingredient with a binding 
and /or lubricating agent such as a gelatin or cocoa butter or other low 
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melting vegetable or synthetic wax or fat. Identified compounds which 
are active on topical administration can be formulated as transdermal 
compositions. Such compositions include, for example, a backing, active 
compound reservoir, a control membrane, liner and contact adhesive. 
5 The typical daily dose of a varies according to individual needs, the 
condition to be treated and with the route of administration. Suitable 
doses are in the general range of from 0.001 to 10 mg/kg bodyweight of 

the recipient per day. 

The following examples are to illustrate the invention, but 
10 should not be interpreted as a limitation thereon. 
Examples 

Cloning, Expression and Purification of the Androgen Receptor 
Ligand-Binding Domain 

The rat androgen receptor (rAR) ligand-binding domain (LBD) 

1 5 cDNA, from amino acid 646 to 90 1 , was cloned from a rat prostate cDNA 
library (Clontech) by PCR. The primers used were 
CATATGATTGAAGGCTATGAATGTCAACCTATCTTT (SEQ ID NO: 3) and 
TCACTGTGTGTGGAAATAGATGGG (SEQ ID NO:4). The rat AR LBD was 
expressed as a fusion protein driven by the T7 promoter of pET28b 

20 vector (Novagen) to include an N-terminal polyhistidine tag and a 
thrombin cleavage site. The replacement of T877 for A (the LNCaP 
mutation) in this rAR LBD expression construct was performed with the 
QuickChange Site-Directed Mutagenesis kit (STRATAGENE) . 
Dihydrotestosterone (DHT) was included in the E. coli (BL21-DE3) 

25 fermentation medium at a concentration of 0.05 mM. Induction with 
0.4 mM isopropyl-P-D-thiogalactopyranoside was allowed to proceed for 
16 hours at 20°C in M9 minimal media supplemented with casamino 
acids (Difco) and trace minerals, and pellets were stored at -70 °C. A 
total of 6-9 mg of recombinant AR LBD was isolated from a 15 gram cell 

30 pellet following sonication and chromatography on a nickel-chelate resin. 
Polyhistidine-tagged AR LBD of approximately 90% purity eluted at 0.45 
M imidazole in a gradient of 0.05- 1 .0 imidazole. This material was 
quantitatively cleaved at an engineered site for thrombin recognition, 
followed by chromatography on benzamidine sepharose (Pharmacia) to 

35 remove the serine protease, with a 70% recovery. The final sample 

containing the sequence Gly-Ser-His-Met at the N-terminus followed by 
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residues 646-901 of the rat (664 - 919 in the human) AR LBD protein, 
was concentrated for crystallography to 2 mg/ral in 20 mM Tris (pH 7.5), 
0.5 M NaCl, 10% glycerol, 1 mM EDTA and 1 mM DTT. 

The sequence of the rat Androgen Receptor LBD (AR), as cloned, 
5 with the secondary structural features marked. For comparison, the 
aligned sequence of the Progesterone Receptor LBD (PR) is given. 
Residues involved in androgen binding are marked (*). Residues which 
are disordered in the crystal structure are underlined. The AR sequence 
is SEQ ID NO:l. The PR sequence is SEQ ID NO: 2. 
10 ! H3 _ 

660^3SHMIEGYECQPIF ^ 
678 GQDIQLIPPLINLLMSIEPDVIYAGHDNTKPDTSSSLLTSLNQLGE PR 

15 I | H4/5 | 

710 RQLVHVVKWAKALPGFRNLHVDDQMAVIQYSWMGLWFAMGWRSFTNVNS AR 
724 RQLLSVWWSKSLPGFRNLHIDDQITLIQYSWMSLWFGLGWRSYKHVSG PR 

20 

SSSS SSS |-H6| | H7 | |---H8~ 

760 RMLYFAPDLVFNEYRMHKSRMYSQCVRMRHLSQEFGWLQITPQEFLCMKA AR 
774 QMLYFAPDLILNEQRMKESSFYSLCLTMWOIPOEFVKLOVSQEEFLCMKV PR 

25 -I SSS | H9- 



SSSS SSS |-H6| | H7 | |---H8- 

RMLYFAPDLVFNEYRMHKSRMYSQCVRMRHLSQEFGTOQITPQEFLCMK^ 
774 QMLYFAPDLILNEQRMKESSFYSLCLTMWQIPQEFVKLQVSQEEFLCMKV 

-I SSS | H9- | I 

810 LLLFSIIPVDGLKNQKFFDELRMNYIKELDRIIACKRKNPTSCSRRFYQL AR 
824 LLLLNTIPLEGLRSQTQFEEMRSSYIRELIKAIGLRQKGWSSSQRFYQL PR 

— H10/11-- I I"! I H12 I 

30 860 TKLLDSVQPIARELHQFTFDLLIKSHMVSVDFPEMMAEIISVQVPKILSG AR 
874 TKLLDNLHDLVKQLHLYCLNTFIQSRALSVEFPEMMSEVIAAQLPKILAG PR 

* 

SSS 

910 KVKPIYFHTQ AR 
35 924 MVKPLLFHK PR 

Crystallization 

The AR-LBD - Dihydrotestosterone (DHT) complex was 
crystallized at 20° C by vapor diffusion in the hanging-drop mode. In the 

40 crystallization trials, the protein complex as obtained from MMB&B was 
used without any further purification. In the initial trial to obtain 
crystallization conditions, a sparse matrix crystallization screen was 
done with the Crystal Screens 1 and 2 (Hampton Research). For each 
crystallization trial, a 2 \il drop was prepared by mixing 1 \x\ of purified 

45 protein (1.9 mg ml" 1 ) with an equal volume of reservoir solution. The 
reservoir contained 1.0 ml of the precipitating solution. Small crystals 
were obtained in two days from six of the drops (table 1). 
Table 1: Crystallization Conditions 
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Screen/# Precipitating Solution Result 
1/16 1.5 M Li Sulfate, 0.1M Na Hepes, pH 7.5 Small r ds 

1/29 0.8 M Na/K Tartrate, O.lMNa Hepes, pH 7.5 Larger rods 
1/30 2% v/v PEG 400, 2.0 M Am Sulfate, 
5 O.lMNa Hepes, pH 7.5 Small cubes 

2/20 1.6 M Mg Sulfate, 0.1M MES, pH 6.5 Small crystallites 

2/32 1.6 M Am Sulfate, 0.1 M Na CI, 

0. 1 M Hepes, pH 7.5 Small rods 

2/42 1 2% v/v Glycerol, 1 . 5 M Am Sulfate , 
10 0. 1 M Tris, pH 8.5 Sraa11 rods 

The largest single crystal, measuring 0.05 mm x 0.04 mm x 0.26mm, 
was obtained from Crystal Screen 1, solution # 29 (0.8 M Na/K Tartrate, 
0.1M Na Hepes, pH 7.5). This crystal was subsequently used in the 

15 initial data collection run (as described below). 

Optimization of the crystallization condition was done using a 
Cyperlab C-200 automated crystallization robotic workstation. A 
crystallization trial was performed using a 24-step linear gradient from 
0.6 M to 1.26 M Na tartrate, 100 Mm Hepes, pH 7.5 (Note: The 

20 optimization screen used sodium rather than sodium/ potassium 

tartrate). The largest, rod shaped crystal, with dimensions 0.09 mm x 
0.09 mm x 0.20mm, was obtained at 0.887 M Na Tartrate. This crystal 
was used in the second data collection run (as described below). 
Data Collection and Reduction 

25 For the initial X-ray experiment, the crystal from the initial 

crystallization screen was flash cooled by dipping it in a cryoprotectant 
solution containing the precipitating solution (0.8 M Na/K Tartrate, 0.1M 
Na Hepes, pH 7.5) with 250mm NaCl and 20% Glycerol added and then 
placed it in a cold stream at 100° K. 

30 For data set 1, X-ray diffraction data were collected with an R- 

Axis II imaging plate detector. The radiation was generated from a 
Rigaku RU-200 rotating at 5 kw power with a fine focus filament (0.3 x 
3.0mm) was monchromated (Cu Ka) and intensified by focusing with 
Yale mirrors (Molecular Structure Corporation). The crystal diffracted to 

35 better than 2.4 A resolution. Autoindexing and processing of the 

measured intensity data was carried out with the HKL software package 
(Otwinoski, L. (1993) in CCP4 Study Weekend, Data Collection and 
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Processing (Sawyer.L., Issacs, N., and Bailey, S., Eds.) pp 56-62, SERC 
Daresbury Laboratory, Warrington, U.K). X-ray diffraction from the 
crystals have the symmetry and systematic absences of the 
orthorhombic space group P212121 with unit cell dimensions a = 56.03 
5 A , b = 66.27 A, c= 70.38 A, and one molecule per asymmetric unit 
(Mathews Volume = 2.16 A 3 Da- 1 ). 

A second X-ray diffraction data set (data set 2) was collected at 
the IMCA-CAT beamline (sector 17ID) at the Advanced Photon Source 
synchrotron at Argonne, II. The crystal from the optimization screen 
10 described above, was flash-cooled by placing it in the reservoir solution 
(0.877 M Na Tartrate, 0. 1M Na Hepes, pH 7.5) with 250mm NaCl and 
20% Glycerol added, and then placing it in a cold stream at 100° K. The 
data were collected with a Bruker 2x2 mosaic CCD detector. The crystal 
diffracted to better than 2.0 A. Autoindexing and processing of the 
1 5 measured intensity data was carried out with the HKL2000 software 
package (Otwinoski, L. (1993) in CCP4 Study Weekend, Data Collection 
and Processing (Sawyer.L., Issacs, N., and Bailey, S., Eds.) pp 56-62, 
SERC Daresbury Laboratory, Warrington, U.K.). The data collection and 
processing statistics for both data sets are summarized in table 2. 
20 structure Determination (Molecular Replacement) 

The structure was determined by the method of molecular 
replacement with the program AmoRe (Navaza, J. (1994) AmoRe: an 
automated package for molecular replacement. Acta Cryst. D50, 157- 
163). The Progesterone Receptor ligand binding domain (PR-LBD), which 
25 has 54% sequence identity and 76% sequence homology to AR-LBD, was 
used as the search model. The atomic coordinates of PR-LBD (Protein 
Data Bank reference code 1A28) by Williams & Sigler (Nature 1998 393, 
391) were unmodified except for the removal of the ligand and solvent 
molecules. A second molecular replacement search was performed with 
30 a theoretical model for the AR-LBD provided by the MMS/CADD group 
(table 3). The PR-LBD structure gave a slightly better solution than the 
AR-model (1.7c vs. 1.3a above background) and was used in the 
subsequent refinement, although both structures gave equivalent results 
with no molecular interpenetration. 
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Table 2: Data C 11 cti 


n and Proc ssing 

Data Set 1 


Data Set II 


Date 


5/19/99 


6/17/99 


Source/Detector 


Rigaku RU-200 


IMCA/APS 17ID 


npteetor 

UvlCwlwl 


R-axis II 


Bruker 2x2 




CuKa(1.54A) 


1.00 A 




364 


400 


Ad) 


0.5° 


0.5° 


Crystal to plate aisiance 


150 mm 

I W V Mill* 


135 mm 


Time/rrame 


20 min 


1 sec 


Number of Observations 


209,891 


416,207 


Data Reduction Proaram 


HKL 


HKL2000 


1 IniniiP rpflpctions 


10,824 


18,308 


Rpflprtinn^ Used 


10,114 


16,862 




2.4 A (2.5-2.4 A) 


2.0 A (2.1-2.0 A) 


Completeness 


93 8% (71 6%} 


92.6 % (73.0 %) 


ft ft. 1? ^ lL j 

Multiplicity 




7.3 


Mosiacity 


n ^n? 


0.332 


RsyiTi (On 1} 


4 2 % (17 5%) 


10.1 %(25.6%) 


Space Group 


P212121 


P212121 


a 


56.09 A 


56.08 A 


b 


66.43 A 


65.76 A 


c 


70.54 A 


70.51 A 


Wilson B-value 


39.05 A 2 


29.26 A 2 



Values for data in the last resolution shell are given in parentheses 



30 



35 



40 



Search Model: 


Progesterone 


AR Model 




(PDB file 1A28) 




Program Used 


AmoRe 


AMoRe 


Resolution Range 


8.0-4.0 A 


8.0-4.0 A 


Radius of Integration 


25 A 


25 A 


Number of Reflections 


2.393 


2.393 


Number of Atoms 


2,019 


2,094 


RF Correlation (2 nd solution) 


0.16(0.12) 


0.13(0.11) 


TF Correlation (2 nd solution) 


0.31 (0.20) 


0.23 (0.14) 


TF R-factor (2 nd solution) 


49.0% (52.7%) 


52.1% (54.0%) 


Rigid Body Correlation 


0.34 


0.28 


Rigid Body R-factor 


48.1% 


50.4% 



Structure Refinement 

The structure was first refined with the initial 2.4 A data set (2a 

data, 9,818 reflections) by the method of simulated annealing with 
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program X-PLOR (Brunger, A.T., Kuriyan, J. & Karplus, J. (1987) 
"Crystallographic R-factor refinement by molecular dynamics", Science 
235: 458-460) in four cycles to an R-factor of 27.7%. Each refinement 
cycle consisted of a least-squares minimization, simulated annealing at 
5 3000°, and individual isotropic B-factor refinement. The first cycle, with 
the Progesterone molecular replacement .model unmodified for the 
sequence differences between AR and PR, gave an R-factor of 33.8%. The 
model was then rebuilt using the AR amino acid sequence and a second 
refinement cycle gave an R-factor of 29.6%. At this stage of the 

10 refinement, the DHT molecule could be clearly seen in the difference 
electron density map. 

After each cycle, the structure was carefully examined using 
molecular computer graphics program Chain (Sack, John S. (1988) 
"CHAIN- A Crystallographic Modeling Program", J. Mol Graphics 6: 224- 

15 225) and modifications were made to the structure as needed. Several 
residues, from both the N- and C-termini of the molecule, which were not 
seen in the electron density maps were removed from the model. After 
the second cycle of refinement, the DHT was added to the model. Solvent 
molecules were added where there were 3a peaks in both the 2Fo - Fc 

20 and Fo - Fc electron density maps and removed if their B-factor went 
above 60 A 2 . After four cycles of X-PLOR refinement, a careful 
examination of the electron density showed the model to be much 
improved, although molecular refitting still needed to be done in some 
regions. The density is clear except for some of the loop regions, 

25 particularly the loop between helices I and II, which was also poorly 
modeled in the PR structure. 

Table 4: Refinement Statistics (X-PLOR) 

30 Part I: 2a data (9,818 reflections) to 2.4 A 

Cycle 1 251 residues No ligand 

Cycle 2 248 residues No ligand 

Cycle 3 247 residues ligand 

Cycle 4 246 residues ligand 

35 

Part II: 2o data (15,067 reflections) to 2.0 A 

Cycle 5 246 residues ligand 32 waters R = 27.9 % 

Cycle 6 246 residues ligand 57 waters R = 26.8 % 



0 waters R = 33.8 % 

0 waters R = 29.6% 

18 waters R = 28.3% 

40 waters R = 27.7% 
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Cycle7 246 residues ligand 58 waters R = 26.7% 

Cycle 8 246 residues iigand 1 06 waters R = 24.2% 

At this stage of the refinement, the higher resolution data collected at the 
5 APS synchrotron became available. Four additional X-PLOR refinement 
cycles were performed with the 2.0 A data set (2a data, 15,067 
reflections) following the same protocol. The final structure has an R- 
factor of 24.2% with a total of 106 solvent molecules. The final 
refinement statistics are presented in table 5. 

10 Table 5: Final Refinement Parameters 

Resolution Range 1 0.0 - 2.0 A 

Reflections 15,067 

R-factor 24.2 % 

R-free 31.2% 

15 # residues 246(672-917) 

# atoms 2118 (1991 atoms, 21 DHT, 106 waters) 

RMS deviations 

bond lengths 0.014 A 

bind angles 1-594° 

20 Improper angles 1.558° 

Average B-factors 

Protein 25.02 A 2 

DHT 14.40 A 2 

Water 30.21 A 2 

25 Wilson B-factor 29.26 A 2 

Description of the Molecule 

The structure of AR-LBD is complete from residues 67 1 through 
917 for the wild-type and 672 to 918 for the LNCaP mutant. Analysis of 

30 the structures with program PROCHECK showed only minor exceptions 
to the allowed geometry. In the wild-type structure, the first six residues 
of the chain (664 - 670) are not seen in the electron density and are 
probably disordered. This leaves only one residue before the initial 
residue of the first a-helix (HI) in the wild-type structure, none in the 

35 LNCaP mutant structure. On the C-terminal end, the last two residues 
(918 - 919) are not seen in the electron density of the wild-type 
structure, but only the last is missing in the mutant. In addition, since 
the loop between helices 9 and 10 (residues 845-850) is not well defined, 
it has been modeled as poly-alanine. 
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Folding and Packing 

As expected, the AR LBD has the same overall three- 
dimensional structure as those of the other nuclear hormone receptor 
5 LBDs. The molecule is folded into a "helical sandwich" consisting of 10 
a-helices. There are four small pieces of beta strand, forming two short 
beta-sheets; one in the core of the molecule between helices 5 and 6 near 
the ligand binding site, and the other formed by the loop between helices 
8 and 9 and the C-terminus. This latter sheet, also seen in the PR LBD 
10 structure, holds helix 12 in the closed, agonist conformation, close to 
and capping the ligand binding site. 
Lack ofDimer Formation 

Studies have indicated that the estrogen, progesterone, and 
androgen receptors all function as homodimers and that AR LBD forms 
1 5 dimers in solution. Thus it could be expected that the AR LBD domains 
might form homodimers in the crystal similar to those previously seen in 
the RXR-a and estrogen receptor (ER) LBD crystal structures. In the PR 
LBD structure, the two monomers in the asymmetric unit are related by 
a dyad, but the two-fold- symmetric configuration is strikingly different 
20 from that of the RXR and ER homodimers and the area buried in this 
configuration is much smaller than would be expected for stable dimer 
formation. In the AR LBD crystal, the ligand-binding domains are 
unmistakably monomelic, and there are no twofold axes relating 
domains. Moreover, the homodimer interaction seen in the structures of 
25 ER and RXR LBDs is not possible for the AR LBD, as the C-terminal tail 
is bound to the groove formed by helices 9 and 10, thereby obstructing 
the contact region between monomers in RXR and ER homodimers. 
Whether this observation reflects a non-dimeric state of the AR LBD in 
the functional AR dimer or is an artifact of the conditions used for AR 
30 LBD crystallization remains to be determined. It is noteworthy that the 
ER LBD constructs used for crystallization have been truncated to 
remove an analogous C-terminal extension. 
Comparison with Progesterone Receptor 

While there is only 55% sequence identity between AR LBD and 
35 PR LBD, there is a 77% sequence similarity, and as expected, the three- 
dimensional structures of these two LBDs are very similar with an r.m.s. 
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deviation of 1.3 A between corresponding Ca atom positions. As with PR, 
AR LBD has no helix 2, but its helix 12 is longer than those of RXR or 
TR. In the case of AR, while helices 10 and 1 1 are nearly contiguous, 
there is a proline residue at position 868 that causes a kink between the 

5 two helices . 

Comparison with theoretical AR model 

The theoretical AR model obtained from MMS/CADD and the AR 
structure have an r.m.s. deviation of 1.29 A for the 247 alpha carbons. 
More importantly, the hormone binding site is virtually identical with the 
10 exception of the side chains of Met 732(749), Leu 863(880), and Leu 

864 (881) which are in different rotomers. This causes the binding cavity 
to be more compact in the AR structure. Also, there is a flip of the side 
chain of Asn 688(705) so that the ND2 atom is in position to make a 
hydrogen bond to the carbonyl off of the D-ring. 

1 5 Table 6: Comparison of AR-LBD to PR-LBD and Theoretical model 

13 * v calpha Main Side Total 

ARvs Pr 1.22(246) 1.27(983) 1.80(772) 1.53(1.755) 

ARvs.CADD 1.25(246) 1.31 (983) 2.41(971) 1.93(1,954) 

20 Binding of Dihydrotestosterone 

At the end of the molecular replacement procedure with the PR 
LBD structure without progesterone as search model, the largest piece of 
difference electron density, at approximately the 3o level, was found at 
the progesterone-binding site. Replacing the bound progesterone agonist 

25 (which has a carboxyl group at the 17-position) with a model of d- 

hydrotestosterone (DHT, which has a hydroxyl group at the 17-position) 
produced an even better fit to the difference electron density, indicating 
that DHT binds to AR LBD in an almost identical fashion to the way 
progesterone binds to PR LBD. Both agonists interact with helices 3, 5, 

30 and 1 1 of their respective LBDs. Ring A, which is identical in the two 
steroids, makes similar interactions with the side chains of Q71 1, M745, 
R752 (Q725, M759, R766 in PR LBD), and a conserved water molecule. 
The interactions with ring C are also similar, with close contacts to the 
mainchain of L704 (L718 in PR LBD) and sidechain of N705 (N719 in PR 

35 LBD). The contact between C 18 and the Oyl of T877 is unique to the 
wild-type AR LBD, as the corresponding cysteinyl side chain is pointed 
away from the steroid in the PR LBD structure. 
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Since progesterone and DHT differ in the substituent on ring D, 
it is expected that interactions with respective receptors will differ in this 
region. In the AR LBD structure, N82 of N705 makes a hydrogen bond to 
the D-ring hydroxyl of DHT. A similar interaction could be made 

5 between progesterone and the PR LBD if there were a flip of both the 
steroid acetyl group and the side chain of N719. This would place the 
oxygen approximately 3.2 A from the N82 atom of Asn 719. The ligand 
contact surface area is slightly larger for progesterone in PR than for 
DHT in AR (483 vs. 448 A 2 ) but they are both considerably smaller than 

10 the ligand contact surface area in TR (559 A 2 ), PPARy (583 A 2 ), or the 

Vitamin D receptor (677 A 2 ). 

Figure 3 shows two orthogonal views of the omit electron density 
map of dihydrotestosterone (DHT) in the hormone-binding site of AR- 
LBD. There are hydrogen bonds between the steroid and the side chains 
15 of Arg 752 and Asn 705. 

Table 7: Dihydrotestosterone Contacts (3.4 A) 

Hydrogen Bonds 

20 



25 



30 



35 



03 
03 


Arg 752 Nh2 
Gin 711 Ne2 


2.89 A (2.77 A) 
3.36 A (3.20 A) 


O20 
O20 


Asn 705 N52 
Thr 877 Oy1 


2.80 A (3.20 A) 
2.70 A (N/A) 


Possible Close Contacts 




C11 


Leu 704 O 


3.31 A 


C12 


Asn 705 N52 


3.07 A 


C17 


Asn 705 N52 


3.34 A 


C19 


Met 745 S5 


3.38 A 


C18 


Thr 877 Oy1 


3.07 A 



Comparison with Progesterone binding 

Comparison of the structure of DHT in the AR-LBD with the 
40 structure of progesterone in the PR- LBD (Williams, S.P. & Sigler, P.B. 
(1998) "Atomic Structure of Progesterone Complexed with its Receptor", 
Nature 393, 391) shows a similar mode of binding. Ring A, which is 
identical in the two steroids, makes similar interactions with the side 
chains of Q711, M745, R752, Q711 and a conserved water molecule 
45 (table 8). The interaction with ring C are also similar, with close 
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contacts to the mainchain of L704 and sidechain of N705. The contact 
from C18 to the Oyl of T877 is unique to AR-LBD, as the corresponding 
cysteine sidechain is pointed away from the steroid in the PR-LBD 
structure 

Since progesterone and DHT differ in the substitution off of ring 
D it is expected that there will be different interactions with the protein 
in this region. In the AR structure, the N82 atom of Asn 705 makes 
hydrogen bond to the D-ring hydroxyl. 

A similar interaction could be made in the PR if there were a flip 
of both the steroid carboxyl group and the side chain of N719. This 
would place the carboxyl oxygen approximately 3.2 A from the N52 atom 
of Asn 719. In AR-LBD, there is also a close contact to the side chain of 
T877 which is absent in the PR-LBD structure. 

Figure 4 shows comparison of AR and PR steroid binding 
Comparison of the binding of dihydrotestosterone to AR-LBD (top) and of 
progesterone to PR-LBD (bottom). Note that an additional hydrogen bond 
interaction would be possible if both the sidechains of both N719 and the 
progesterone were flipped. 

Table 8: Comparison of AR and PR steroid binding 



AR 



PR 



25 



30 



Ring A 
03: 



C19 



H-bond to R752 NH2 (2.9 A) 

H-bond to water (3.5 A) 

SC of Q71 1 in different rotomer 
distance to 03 is 3.4 and 4.13 A 

Contact to M745 SD (3.4 A) 



H-bond to R766 NH2 (2.8 A) 

H-bond to water (3.1 / 3.4 A) 

Contact to SC of Gin 725 
distance to 03 is 3.2 and 3.3 A 

Similar orientation (3.5 A) 



35 



40 



C2: 

Ring C 

C11 
C12 
C18 



SCofQ711 (3.5 A) 



LO704 O (3.3A) 

Contact to N705 N52(3.1A) 

Contact T877 Oy1 (3.1 A) 



different rotomer (3.2 & 3.3) 
distance to C4 is 4.1 A 



(3.5A) 

Contact to N719 081 (3.4 A) 

SC of C891 pointing away 
distance to S-y is 3.8 A 



45 



Ring D 
O20/C21 



021 in AR is close to C21 in PR (Possible flip of Carboxyl in PR?) 
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10 



N/A O20: Contact to C891 Ca (3.2 A) 

O20- H-bond N705 N62 (2.8A) C21: Contact to N719 OD1 (3.2 A) 

O20: Contact T877 Cy1 (2.7 A) SC of C891 pointing away 

C1 7 Contact N705 N82 (3.3 A) Ring in slightly different orientation; 

distance to N719 051 is 4.7 A 



Structure of the Complex ofDHT with the LBD of the LNCaP Mutant 

In the LNCaP mutant, T877 is replaced by an alanine residue. 
The mutant LBD structure has an r.m.s. deviation of 0.8 A compared to 
the wild-type structure, close to the expected r.m.s. deviation due to the 

15 estimated errors in the coordinates. In particular, the binding of DHT is 
essentially identical by wild-type and mutant LBDs except at the point of 
mutation. Here the replacement of T877 by alanine leaves additional 
space off the D-ring of DHT to accommodate a larger substituent on 
position 17. This may explain the promiscuous ability of the LNCaP 

20 mutant, unlike wild-type AR, to bind to a variety of other hormones and 
analogs like some progestins, estrogens and Cortisols that differ from 
DHT in substitution at position 17. For example, the binding of 
flutamide, estradiol, and progesterone to the LNCaP mutant can activate 
the mutant receptor. Conversely, mutation of T877 to residues with 

25 larger sidechains such as aspartic acid and lysine would be expected 
completely preclude the binding of ligands with any substituent at 
position 17 of the D-ring and such mutations have been shown to totally 
eliminate androgen binding. 



WO 01/27622 



PCT/US00/28495 





ATOM 


1 


CB 


T T P 

I Lb 


£1 9 
Oft. 


5 


ATOM 


2 


CG2 


T T P 


fi7 9 




ATOM 


3 


CGI 


T T C 

ILL 


£7 9 




ATOM 


4 


CD1 


ILE 


CI 9 




ATOM 


5 


C 


ILE 


c~i 9 




ATOM 


6 


O 


ILE 


cn *y 


10 


ATOM 


7 


N 


ILE 


£1 9 




ATOM 


8 


CA 


ILE 


£7 9 




ATOM 


9 


N 


PHE 


6 / J 




ATOM 


10 


CA 


PHE 


0 / J 




ATOM 


11 


CB 


PHE 


D / J 


15 


ATOM 


12 


CG 


PHE 


O / J 




ATOM 


13 


CD1 


PHE 


£ T "2 




ATOM 


14 


CD2 


PHE 


673 




ATOM 


15 


CE1 


PHE 


67 j 




ATOM 


16 


CE2 


PHE 


O / J 


20 


ATOM 


17 


CZ 


PHE 


b f 3 




ATOM 


18 


C 


PHE 


b / J 




ATOM 


19 


O 


PHE 


c *7 "a 




ATOM 


20 


N 


LEU 


C ^ A 

b / 4 




ATOM 


21 


CA 


LEU 


C 1 A 

67 4 


25 


ATOM 


22 


CB 


LEU 


674 




ATOM 


23 


CG 


LEU 


a a 
Of 4 




ATOM 


24 


CD1 


LEU 


67 4 




ATOM 


25 


CD2 


LEU 


6/4 




ATOM 


26 


C 


LEU 


67 4 


30 


ATOM 


27 


O 


LEU 


b ' 4 




ATOM 


28 


N 


ASN 


6 / D 




ATOM 


29 


CA 


ASN 


675 




ATOM 


30 


CB 


ASN 


675 




ATOM 


31 


CG 


ASN 


675 


35 


ATOM 


32 


OD1 


ASN 


67 5 




• ATOM 


33 


ND2 


ASN 


675 




ATOM 


34 


C 


ASN 


675 




ATOM 


35 


O 


ASN 


67 5 




ATOM 


36 


N 


VAL 


676 


40 


ATOM 


37 


CA 


VAL 


676 




ATOM 


38 


CB 


VAL 


676 




ATOM 


39 


CGI 


VAL 


676 




ATOM 


40 


CG2 


VAL 


676 




ATOM 


41 


C 


VAL 


676 


45 


ATOM 


42 


O 


VAL 


676 




ATOM 


43 


N 


LEU 


677 




ATOM 


44 


CA 


LEU 


677 




ATOM 


45 


CB 


LEU 


677 




ATOM 


46 


CG 


LEU 


677 


50 


ATOM 


47 


CD1 


LEU 


677 




ATOM 


48 


CD2 


LEU 


677 




ATOM 


49 


C 


LEU 


677 




ATOM 


50 


O 


LEU 


677 




ATOM 


51 


N 


GLU 


678 


55 


ATOM 


52 


CA 


GLU 


678 




ATOM 


53 


CB 


GLU 


678 




ATOM 


54 


CG 


GLU 


678 




ATOM 


55 


CD 


GLU 


678 




ATOM 


56 


OE1 


GLU 


678 


60 


ATOM 


57 


OE2 


GLU 


678 




ATOM 


58 


C 


GLU 


678 



-38- 



TableA 



14. 


846 


25. 


527 


23. 


734 


1. 


00 


25. 


78 


16. 


247 


25. 


008 


24. 


099 


1. 


00 


25. 


56 


14. 


842 


27. 


035 


23.978 


1. 


00 


25. 


60 


15,312 


27. 


404 


25. 


360 


1. 


00 


25. 


81 


15. 


115 


23. 


900 


21 . 


789 


1. 


00 


25. 


32 


16.189 


23. 


926 


21 . 


195 


1. 


00 


24. 


67 


13. 


004 


25. 


282 


22. 


008 


1. 


00 


24. 


75 


14. 


475 


25. 


215 


22 . 


242 


1. 


00 


25. 


11 


14. 


448 


22 . 


768 


22 . 


030 


1. 


00 


25. 


89 


14. 


980 


21. 


446 


21 . 


635 


1. 


00 


25. 


86 


14. 


020 


20. 


306 


22 . 


029 


1. 


00 


26. 


22 


14. 


557 


18 . 


923 


21 . 


722 


1. 


00 


25. 


12 


15. 


765 


18 . 


501 


22 . 


251 


1. 


00 


25. 


16 


13. 


877 


18 . 


066 


20 . 


874 


1 . 


00 


25. 


81 


16. 


286 


17 . 


255 


21 . 


94 6 


1. 


00 


23. 


42 


14 . 


403 


16. 


809 


20 . 


567 


1. 


00 


25. 


08 


15. 


609 


16. 


417 


21 . 


107 


1 . 


00 


23. 


85 


15. 


21.3 


21. 


374 


20 . 


147 


1. 


00 


25. 


25 


16. 


260 


20. 


926 


19 . 


680 


1. 


00 


24 . 


38 


14 . 


193 


21 . 


792 


19. 


412 


1. 


00 


25. 


01 


14 . 


237 


21 . 


802 


17 . 


969 


1. 


00 


25. 


58 


12. 


833 


21 . 


974 


17 . 


391 


1. 


00 


26. 


05 


12.067 


20. 


. 653 


17 . 


317 


1. 


00 


26. 


55 


10. 


.617 


20. 


.887 


16 . 


935 


1. 


,00 


26. 


35 


12 . 


n c 0 
. IOC 


19. 


.758 


16. 


, 304 


1. 


.00 


26. 


09 


15, 


,199 


22, 


,801 


17 . 


, 357 


1. 


.00 


25. 


,10 


15, 


.743 


22. 


.518 


16 . 


.294 


1. 


.00 


26.08 


15, 


.440 


23, 


.939 


18 . 


. 019 


1. 


.00 


24 . 


.63 


16, 


.356 


24 , 


. 964 


17 , 


.484 


1, 


.00 


23, 


.19 


16, 


.478 


26, 


.215 


1 8 


. 393 


1, 


.00 


24, 


.20 


15, 


.206 


27 


.067 


18 . 


.452 


1, 


.00 


24 , 


.32 


14 


.368 


27 


.062 


17 , 


. 547 


1 


.00 


24, 


.82 


15 


.076 


27 


.817 


x j 


. .j j j 


1 


.00 


24, 


.74 


17 


.726 


24 


. 338 


17 


. 397 


1 


.00 


21, 


.66 


18 


.435 


24 


. 524 


x 0 


.417 


1 


.00 


21 


.43 


18 


.095 


23 


. 612 


1 ft 

X 0 


4 £ £ 

. *i *j v 


1 


.00 


21 


. 17 


19 


.394 


22 


. 952 


1 ft 


. 507 


1 


.00 


20 


.92 


19 


.718 


22 


.442 


X -7 


. j j *i 


1 


.00 


21 


.33 


18 


.899 


21 


. 237 


9D 


94 7 


1 


.00 


24 


.09 


21 


.192 


22 


. 095 




. 0 65 


1 


.00 


21 


.88 


19 


.501 


21 


.830 


1 7 
1 / 


4 7 ? 


1 


.00 


19 


.78 


20 


.421 


21 


. 827 


1 f. 
1 D 


. 0*1 0 


1 


.00 


19 


.99 


18 


.530 


20 


. 923 


1 7 


. H J *a 


1 


.00 


19 


.08 


18 


.601 


19 


. 84 8 


X 0 


4 S \ 


1 


.00 


17 


.91 


17 


.383 


18 


. 921 


16 


.518 


1 


.00 


17 


.50 


17 


.267 


18 


. 083 


17 


.798 


1 


.00 


16 


.78 


16 


.355 


16 


. 934 


17 


.541 


1 


.00 


17 


.01 


18 


.615 


17 


. 555 


18 


.225 


1 


.00 


17 


.10 


18 


.768 


20 


.427 


15 


.068 


1 


.00 


16 


.96 


19 


.640 


20 


.008 


14 


.347 


1 


.00 


14 


.94 


17 


.980 


21 


.445 


14 


.736 


1 


.00 


19 


.12 


18 


.058 


22 


.121 


13 


.437 


1 


.00 


20 


.06 


16 


.972 


23 


.188 


13 


. 317 


1 


.00 


23 


.33 


15 


.532 


22 


.646 


13 


. 381 


1 


.00 


28 


.64 


14 


.459 


23 


.736 


13 


.387 


1 


.00 


32 


.31 


14 


.811 


24 


.943 


13 


. 374 


1 


.00 


34 


.41 


13 


.253 


23 


.384 


13 


.410 


1 


.00 


34 


.91 


19 


.410 


22 


.783 


13 


.243 


1 


.00 


19 


.33 



WO 01/27622 



PCT/US00/28495 



39 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



ATOM 


59 


O 


GLU 


D/O 


ATOM 


60 


N 


ALA 


C^ Q 


ATOM 


61 


CA 


a t a 


C7 Q 
0 / 7 


ATOM 


62 


to 




fi7 9 


ATOM 


63 


c 




O ' J 


ATOM 


C A 

64 


r\ 
\J 


HLtr\ 




ATOM 




N 


TT V 


fip.fi 


ATOM 


C C 

DO 




TT P 




ATOM 


67 


Lb 


TT P 


fiRfi 
o o w 


ATOM 


68 


CG2 


TT C" 


DO U 


ATOM 


69 


CGI 


TIE* 

ILL 


can 
oou 


ATOM 


70 


CD1 


T T C* 

ILL 


con 
oou 


ATOM 


71 


C 


ILE 


con 
oou 


ATOM 


72 


O 


ILE 


con 
oou 


ATOM 


73 


N 


GLU 


bo l 


ATOM 


74 


CA 


GLU 


bo 1 


ATOM 


75 


CB 


GLU 


CQ 1 

bo i 


ATOM 


76 


CG 


GLU 


DO 1 


ATOM 


77 


CD 


GLU 


CP 1 
DDI 


ATOM 


78 


OE1 


GLU 


DO X 


ATOM 


79 


OE2 


GLU 


CP 1 


ATOM 


80 


C 


GLU 


CQ 1 

box 


ATOM 


81 


O 


GLU 


C Q 1 
DDI 


ATOM 


82 


N 


PRO 


CQO 

boZ 


ATOM 


83 


CD 


PRO 


boZ 


ATOM 


84 


CA 


PRO 


CQ O 
DOZ 


ATOM 


85 


CB 


PRO 


CQ O 

boZ 


ATOM 


86 


CG 


PRO 


COO 
boZ 


ATOM 


87 


C 


PRO 


^ Q O 
DOZ 


ATOM 


88 


O 


PRO 


C Q O 
DOZ 


ATOM 


89 


N 


GLY 


C Q *3 

bo J 


ATOM 


90 


CA 


GLY 


boo 


ATOM 


91 


C 


GLY 


bo j 


ATOM 


92 


O 


GLY 


COT 

bo j 


ATOM 


93 


N 


VAL 


CQ /) 

bo H 


ATOM 


94 


CA 


VAL 


CQ A 

bo 4 


ATOM 


95 


CB 


VAL 


C O A 
DO 4 


ATOM 


96 


CGI 


VAL 


bo4 


ATOM 


97 


CG2 


VAL 


C Q A 

bo H 


ATOM 


98 


C 


VAL 


CO A 

684 


ATOM 


99 


O 


VAL 


C O A 

684 


ATOM 


100 


N 


VAL 


cor, 

bob 


ATOM 


101 


CA 


VAL 


685 


ATOM 


102 


CB 


VAL 


685 


ATOM 


103 


CGI 


VAL 


bob 


ATOM 


104 


CG2 


VAL 


roc 
bob 


ATOM 


105 


C 


VAL 


bob 


ATOM 


106 


O 


VAL 


bo b 


ATOM 


107 


N 


CYS 


CQ C 

bob 


ATOM 


108 


CA 


CYS 


CQ C 

bo b 


ATOM 


109 


CB 


CYS 


bob 


ATOM 


110 


SG 


CYS 


bob 


ATOM 


111 


C 


CYS 


68 6 


ATOM 


112 


0 


CYS 


c o c 

68 6 


ATOM 


113 


N 


n T ft 

ALA 


CQ T 

bo / 


ATOM 


114 


CA 


ALA 


687 


ATOM 


115 


CB 


ALA 


687 


ATOM 


116 


C 


ALA 


687 


ATOM 


117 


0 


ALA 


687 


ATOM 


118 


N 


GLY 


688 


ATOM 


119 


CA 


GLY 


688 


ATOM 


120 


C 


GLY 


688 



19 . 


966 


22 . 


737 


12. 


152 


1. 


00 


18. 


20 


19. 


966 


23. 


324 


14 . 


329 


1. 


00 


19. 


45 


21; 


257 


24 . 


018 


14 . 


303 


1. 


00 


18. 


84 


21 . 


388 


24 . 


919 


15. 


517 


1. 


00 


17. 


67 


22. 


472 


23. 


094 


14. 


195 


1. 


00 


19. 


25 


23. 


479 


23. 


436 


13. 


558 


1. 


00 


19. 


27 


22. 


395 


21. 


914 


14 . 


802 


1. 


00 


18. 


82 


23. 


518 


20. 


984 


14 . 


742 


1. 


00 


17. 


49 


23. 


674 


20. 


231 


16. 


056 


1. 


00 


17. 


05 


24 . 


022 


21. 


213 


17. 


158 


1. 


00 


16. 


83 


22. 


393 


19. 


4 67 


16. 


391 


1. 


00 


15. 


55 


22. 


558 


18. 


575 


17. 


552 


1. 


00 


13. 


80 


23 . 


516 


19. 


984 


13. 


593 


1. 


00 


16. 


89 


24 . 


518 


19. 


303 


13. 


370 


1. 


00 


17. 


12 


22 . 


415 


19. 


922 


12. 


847 


1. 


00 


16. 


79 


22 . 


265 


19. 


002 


11. 


719 


1. 


00 


17. 


26 


20. 


902 


19. 


227 


11. 


094 


1. 


00 


16. 


72 


20. 


579 


18, 


300 


9. 


952 


1. 


00 


18. 


48 


20. 


473 


16. 


823 


10. 


348 


1. 


00 


17. 


51 


20. 


659 


16. 


502 


11. 


524 


1. 


00 


17. 


58 


20. 


,214 


15. 


981 


9. 


467 


1. 


00 


18. 


59 


23. 


, 370 


19. 


128 


10. 


673 


1, 


00 


18. 


79 


23. 


,517 


20. 


173 


10. 


043 


1. 


00 


20. 


09 


24 . 


, 145 


18. 


,044 


10, 


,437 


1. 


00 


19. 


02 


23 . 


, 969 


16. 


,704 


11. 


,019 


1. 


,00 


18. 


22 


25 . 


,252 


18. 


,021 


9. 


.472 


1. 


,00 


19. 


,11 


25 . 


, 681 


16. 


, 546 


9. 


,493 


1. 


,00 


18, 


,30 


25 . 


. 338 


16. 


. 109 


10, 


,846 


1, 


.00 


17. 


,08 


24 . 


. 912 


18. 


.475 


8. 


.057 


1, 


.00 


19. 


,76 


23, 


,771 


18, 


, 382 


7, 


.625 


1. 


.00 


21. 


,13 


25, 


. 901 


18. 


.995 


7. 


.339 


1, 


.00 


20. 


, 64 


2 5 , 


. 665 


19. 


.422 


5, 


.972 


1, 


.00 


21. 


.67 


25 


.809 


18, 


.260 


4 


.990 


1. 


.00 


23. 


.13 


25 


. 595 


17, 


. 108 


5 


.355 


1 


.00 


23, 


.47 


26 


. 190 


18 


.567 


3, 


.748 


1, 


.00 


23, 


.58 


26 


. 365 


17 


.573 


2 


. 685 


1 


.00 


22 


.44 


26 


.320 


18 


.216 


1 


.259 


1 


.00 


24 


.93 


26 


.217 


17 


. 130 


0 


.183 


1 


.00 


24, 


.57 


25 


.153 


19 


.228 


1 


.131 


1 


.00 


24 


.89 


27 


.725 


16 


.934 


2 


.811 


1 


.00 


20 


. 64 


28 


.708 


17 


. 614 


3 


.042 


1 


.00 


19 


.82 


27 


.778 


15 


.631 


2 


. 585 


1 


.00 


19 


.05 


29 


.012 


14 


. 878 


2 


.665 


1 


.00 


17 


.89 


28 


.955 


13 


.857 


3 


.867 


1 


.00 


17 


.81 


30 


.303 


13 


. 189 


4 


.086 


1 


.00 


15 


.58 


28 


.527 


14 


. 556 


5 


. 147 


1 


.00 


16 


.27 


29.143 


14 


.112 


1 


.345 


1 


.00 


17 


.88 


28 


.238 


13 


.367 


0 


.969 


1 


.00 


18 


.33 


30 


.224 


14 


.339 


0 


. 609 


1 


.00 


17 


.00 


30 


.451 


13 


.628 


-0 


.650 


1 


.00 


17 


.52 


31 


.101 


14 


.534 


-1 


.706 


1 


.00 


17 


.76 


30 


.166 


16 


.031 


-2 


. 147 


1 


.00 


21 


.38 


31 


.354 


12 


.447 


-0 


.327 


1 


.00 


16 


.97 


32 


.141 


12 


.496 


0 


. 615 


1 


.00 


17 


.15 


31 


.183 


11 


.360 


-1 


.065 


1 


.00 


17 


.74 


31 


.949 


10 


.132 


-0 


.836 


1 


.00 


17 


.57 


31 


.161 


8 


.929 


-1 


.295 


1 


.00 


16 


.91 


33 


.277 


10 


.161 


-1 


.526 


1 


.00 


18 


.06 


34 


.185 


9 


.431 


-1 


.139 


1 


.00 


17 


.98 


33 


.370 


11 


.023 


-2 


.539 


1 


.00 


18 


.50 


34 


.580 


11 


.167 


-3 


.326 


1 


.00 


19 


. 16 


34 


.705 


10 


.099 


-4 


.388 


1 


.00 


19 


. 90 



WO 01/27622 



FCIYUS00/28495 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM . 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM . 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 



121 O GLY 

122 N HIS 

123 CA HIS 

124 CB HIS 

125 CG HIS 

126 CD2 HIS 

127 ND1 HIS 

128 CE1 HIS 

129 NE2 HIS 

130 C HIS 

131 O HIS 

132 N ASP 

133 CA ASP 

134 CB ASP 

135 CG ASP 

136 OD1 ASP 

137 OD2 ASP 

138 C ASP 

139 O ASP 

140 N ASN 

141 CA ASN 
14 2 CB ASN 
14 3 CG ASN 
144 OD1 ASN 
14 5 ND2 ASN 

C ASN 
O 

148 N 
14 9 CA ALA 

150 CB ALA 

151 C 

152 O 

153 N 

154 CA GLN 

155 CB GLN 

156 CG GLN 

157 CD GLN 

158 OE1 GLN 

159 NE2 GLN 

160 C 

161 O 

162 N 

163 CD PRO 

164 CA PRO 

165 CB PRO 

166 CG PRO 

167 C 

168 O 

169 N 

170 CA ASP 

171 CB ASP 

172 CG ASP 

173 OD1 ASP 
17 4 OD2 *ASF 

175 C ASP 

176 0 ASP 
1*77 N SER 

178 CA SER 

179 CB SER 

180 OG SER 

181 C SER 

182 O SER 



146 
147 



ASN 
ALA 



ALA 
ALA 
GLN 



GLN 
GLN 
PRO 



PRO 
PRO 
ASP 



688 

689 

689 

689 

689 

689 

689 

689 

689 

689 

689 

690 

690 

690 

690 

690 

690 

690 

690 

691 

691 

691 

691 

691 

691 

691 

691 

692 

692 

692 

692 

692 

693 

693 

693 

693 

693 

693 

693 

693 

693 

694 

694 

694 

694 

694 

694 

694 

695 

695 

695 

695 

695 

695 

695 

695 

696 

696 

696 

696 

696 

696 



-40- 

35.802 

33.582 

33.577 

32.195 

32.046 

32.782 

31.040 

31.166 

32.219 

33.923 

33.511 

34.719 

35.017 

36.330 

36.696 

37.868 

35.819 

33.872 

33.701 

33.065 

31.933 

30.725 

30.079 

29.187 

30.547 

32.284 

31.419 

33.565 

33.995 

35.148 

34.425 

34.414 

34.757 

35.200 

36.131 

37.538 

38.420 

39.378 

38.115 

33.988 

32.997 

34 .055 

35.138 

32.970 

33.571 

34 .432 

32.575 

33.411 

31.289 

30.776 

29.251 

28.660 

27.532 

29.329 

31.318 

31.237 

32.025 

32.577 

34 .064 

34.854 

32.340 

32.275 



9.730 
9.630 
8.576 
7.917 
6.775 
5.656 
6.724 
5.627 
4.960 
9.063 
10.145 
8.296 
8.691 
8.096 



-4 .771 
-4.907 
-5.912 
-5.900 
-6.857 
-7.033 
-7.796 
-8.516 
-8.074 
-7.328 
-7.731 
-8.073 
-9.447 
-9.963 



8.618 -11.361 
8.497 -11.764 
9.170 -12.061 
8.164 -10.286 
6.952 -10.409 
9.067 -10.832 
8.660 -11.655 
9.562 -11.416 
10.074 
-9.930 
-9.069 
13.136 
■13.999 
8.471 -13.434 
8.365 -14.819 
15.103 
15.027 
6.406 -16.139 
6.241 -13.928 
4.849 -13.942 
12.745 
13.110 
13.902 
13.363 
5.193 -15.186 
3.939 -13.854 
4.298 -13.217 
14 .435 
15.375 
14 .489 
15.265 
16.234 
13.109 
12.204 
022 -12.958 
534 -11.698 
11. 694 



9.313 
8.474 
10.024 
8.608 
8.733 



9.328 
6.913 



4.577 
4 .029 
5.017 
5.587 



2.743 
2.286 
1.762 
0.601 
1.271 
1.304 
.198 



.518 



-1.707 
-1.052 
-2.333 
-2.383 
-1.589 
-2.445 
-1.418 



00 20.86 
00 20.92 
00 22.43 
00 22.00 
00 22.28 
00 22.64 
1.00 22.44 
1.00 23.43 
1.00 23.78 
1.00 24.19 
1.00 24.06 
00 26.27 
00 28.86 
00 28.93 
00 30.03 
00 31.23 
00 29.72 
00 30.15 
00 30.4 6 
1.00 32.35 
1.00 33.60 



1 

1. 

0. 
0 

1.901 -11.608 
2.100 -12.089 
2.794 -11.057 
-0.868 -11.524 



00 32.74 
00 32.95 
00 32.13 
00 33.57 
00 35.13 
,00 36.66 
.00 36.15 
.00 37.44 
.00 36.71 
.00 38.39 
.00 39.10 
.00 39.29 
.00 40.00 
.00 41.81 
.00 44.34 



1.00 45.44 
1.00 45.95 



.00 45.47 
.00 39.48 



1.00 40.11 

1.00 38.78 

1.00 38.88 

1.00 36.98 



00 37.17 
00 38.48 
00 35.56 
00 35.44 
00 34 .27 
00 32.38 
00 29.77 
00 28.80 
1.00 27.09 
1.00 28.72 
1.00 32.55 



-12.429 
-10.424 
-10.077 
-10.425 
-9.567 
-8.577 
-7 .885 



.00 33.50 
.00 31.71 
.00 30.42 
.00 30.43 
.00 31.47 
.00 30.15 



1.00 30.10 



J 



WO 01/27622 



PCT/US00/28495 



ATOM 


183 


N 


PHE 


ATOM 


184 


CA 


PHE 


ATOM 


185 


CB 


PHE 


ATOM 


186 


CG 


PHE 


ATOM 


187 


CD1 


PHE 


ATOM 


188 


CD2 


PHE 


ATOM 


189 


CE1 


PHE 


ATOM 


190 


CE2 


PHE 


ATOM 


191 


CZ 


PHE 


ATOM 


192 


C 


PHE 


ATOM 


193 


O 


PHE 


ATOM 


194 


N 


ALA 


ATOM 


195 


CA 


ALA 


ATOM 


196 


CB 


ALA 


ATOM 


197 


C 


ALA 


ATOM 


198 


O 


ALA 


ATOM 


199 


N 


ALA 


ATOM 


200 


CA 


ALA 


ATOM 


201 


CB 


ALA 


ATOM 


202 


C 


ALA 


ATOM 


203 


0 


ALA 


ATOM 


204 


N 


LEU 


ATOM 


205 


CA 


LEU 


ATOM 


206 


CB 


LEU 


ATOM 


207 


CG 


LEU 


ATOM 


208 


CD1 


LEU 


ATOM 


209 


CD2 


LEU 


ATOM 


210 


C 


LEU 


ATOM 


211 


O 


LEU 


ATOM 


212 


N 


LEU 


ATOM 


213 


CA 


LEU 


ATOM 


214 


CB 


LEU 


ATOM 


215 


CG 


LEU 


ATOM 


216 


CD1 


LEU 


ATOM 


217 


CD2 


LEU 


ATOM 


218 


C 


LEU 


ATOM 


219 


O 


LEU 


ATOM 


220 


N 


SER 


ATOM 


221 


CA 


SER 


ATOM 


222 


CB 


SER 


ATOM 


223 


OG 


SER 


ATOM 


224 


C 


SER 


ATOM 


225 


0 


SER 


ATOM 


226 


N 


SER 


ATOM 


227 


CA 


SER 


ATOM 


228 


CB 


SER 


ATOM 


229 


OG 


SER 


ATOM 


230 


C 


SER 


ATOM 


231 


O 


SER 


ATOM 


232 


N 


LEU 


ATOM 


233 


CA 


LEU 


ATOM 


234 


CB 


LEU 


ATOM 


235 


CG 


LEU 


ATOM 


236 


CD1 


LEU 


ATOM 


237 


CD2 


LEU 


ATOM 


238 


C 


LEU 


ATOM 


239 


O 


LEU 


ATOM 


240 


N 


ASN 


ATOM 


241 


CA 


ASN 


ATOM 


242 


CB 


ASN 


ATOM 


243 


CG 


ASN 


ATOM 


244 


OD1 


ASN 





-41 - 




697 


32.104 


-3.669 


697 


31.890 


-3.933 


697 


31.982 


-5.442 


697 


31.781 


-5.827 


697 


30.536 


-5.722 


697 


32.845 


-6.281 


697 


30.344 


-6.063 


697 


32.659 


-6.626 


697 


31.406 


-6.512 


697 


32.956 


-3.205 


697 


32.641 


-2.393 


698 


34.219 


-3.495 


698 


35.351 


-2.911 


698 


36.596 


-3.305 


698 


35.323 


-1.402 


698 


35.559 


-0.852 


699 


35.029 


-0.737 


699 


35.001 


0.717 


699 


34.845 


1.156 


699 


33.873 


1.281 


699 


34.084 


2.133 


700 


32.682 


0.770 


700 


31.440 


1.185 


700 


30.274 


0.397 


700 


29.249 


0.984 


700 


29.727 


2.269 


700 


28.952 


-0.015 


700 


31.456 


0.977 


700 


30.891 


1.765 


701 


32.103 


-0.093 


701 


32.147 


-0.367 


701 


32.099 


-1.871 


701 


30.582 


-2.050 


701 


30.046 


-2.911 


701 


30.173 


-2.510 


701 


33.261 


0.365 


701 


33.126 


0.734 


702 


34 .356 


0.615 


702 


35.406 


1.378 


702 


36.632 


1.400 


702 


37.204 


0. 120 


702 


34 .874 


2.791 


702 


35.187 


3.423 


703 


34 .023 


3.250 


703 


33.443 


4 .585 


703 


32.755 


4 . 966 


703 


33.748 


5.182 


703 


32.470 


4.678 


703 


32.520 


5.625 


704 


31.596 


3. 684 


704 


30.639 


3.687 


704 


29.691 


2.497 


704 


28.558 


2.583 


704 


27.882 


1.259 


704 


27.582 


3. 681 


704 


31.366 


3.678 


704 


30.925 


4 .340 


705 


32.495 


2.961 


705 


33.307 


2.863 


705 


34.398 


1.794 


705 


33.850 


0.384 


70*5 


34.448 


-0.512 



-8.099 1.00 28.48 
-6.679 1.00 26.74 
-6.423 1.00 25.46 
-4.989 1.00 24.31 
-4.398 1.00 24.48 
-4.220 1.00 24.78 
-3.071 1.00 24.65 
-2.886 1.00 24.77 
-2.315 1.00 24.30 
-5.846 1.00 26.76 
-4.972 1.00 27.12 
-6.140 1.00 25.86 
-5.436 1.00 24.97 
-6.131 1.00 25.52 
-5.300 1.00 25.24 
-4.216 1.00 24.99 
-6.414 1.00 25.11 
-6.490 1.00 23.76 
-7.943 1.00 24.31 
-5.668 1.00 22.91 
-4.795 1.00 22.51 
-5.957 1.00 21.56 
-5.314 1.00 20.84 
-5.937 1.00 19.59 
-6.911 1.00 18.78 
-7.529 1.00 18.69 
-7.957 1.00 17.10 
-3.793 1.00 20.77 
-3.035 1.00 19.67 
-3.350 1.00 20.77 
-1.941 1.00 20.58 
-1.670 1.00 19.52 
-1.567 1.00 19.40 
-2.642 1.00 17.73 
-0.217 1.00 17.25 
-1.241 1.C0 20.86 
-0.088 1.00 21.69 
-1.937 1.00 21.06 
-1.316 1.00 20.96 
-2. 190 1.00 21.52 
-2.175 1.00 23.71 
-1.105 1.00 20.81 
-0.103 1.00 20.82 
-2.028 1.00 20.55 
-1.934 1.00 18.91 
-3.224 1.00 18.44 
-4.194 1.00 20.63 
-0.793 1.00 18.35 
-0.025 1.00 18.89 
-0.662 1.00 17.26 
0.432 1.00 16.37 
0.342 1.00 15.19 
-0.660 1.00 14.03 
-0.748 1.00 12.28 
-0.235 1.00 14.48 
1.761 1.00 16.10 
2.696 1.00 16.81 
1.829 1.00 16.70 
3.049 1.00 16.66 
2.924 1.00 15.46 
2.941 1.00 16.24 
2.385 1.00 16.82 



ATOM 


9 4 s 

C *i J 


ND2 


ASN 




9 4 fi 
z ** o 


r 


ASN 






o 


ASN 


a TOM 


24 8 


N 


GLU 


ATOM 


24 9 


CA 


GLU 


ATOM 


250 


CB 


GLU 


A TOM 




CG 


GLU 




9 59 


CD 


GLU 


AIUM 


951 


0E1 


GLU 


AIUM 


954 
z J** 


0E2 


GLU 


TV TOM 
AIUM 


955 

Z J J 


r 


GLU 


a TOM 


956 


o 


GLU 


T\ TOM 

Al UM 


9 57 


N 


LEU 


RTOM 


95R 


CA 


LEU 


a TOM 
AIUM 


9 5Q 
Z J 3 


CB 


LEU 


a TOM 

AIUM 


9 60 




LEU 


a TOM 
A 1 UM 


961 


CDl 


LEU 


a TOM 
A 1 UM 


969 


CD2 


LEU 


a TOM 


961 
O J 


c 


LEU 


a TOM 
AIUM 


964 

£. D *i 


Q 


LEU 


a TOM 


965 


N 


GLY 


a TOM 
AIUM 


966 


CA 


GLY 


Al UM 


9 67 

Z O r 


p 


GLY 


a TOM 

AIUM 




o 


GLY 


7\ TOM 

AIUM 


9 6Q 
Z O :? 


M 
in 


GLU 


Tt TOM 

ATUM 


£ / u 


PA 


GLU 


ATOM 


971 
Z 1 J. 


PR 


GLU 


A TOM 

ATOM 


97 9 

Z / Z 




GLU 


a TOM 

AIUM 


97 7 

Z / .3 


pn 


GLU 


a TOM 
AIUM 


97 4 
Z / 4 




GLU 


a TOM 


97 5 
Z ' j 


0E2 


GLU 


a TOM 
A 1 UW 


97 6 


Q 


GLU 


a TOM 


97 7 

£.11 


O 

\j 


GLU 


a TOM 

AIUM 


97 ft 


IN 


ARG 


a TOM 
Al UM 


97 Q 
Z ' ~ 


PA 


ARG 


a TOM 

Al UM 




PR 


ARP 


a TOM 
A 1 UM 


9ft 1 


PP 


ARG 


a TOM 
AIUM 


9 R 9 


pn 


ARG 


a TOM 
Al UM 


/0 J 


Mr 


ARP 


a TOM 

A 1 UM 


9ft 4 


P7 


ARG 


a TOM 

Al UM 


9ft ^ 

ZD J 


in n j. 


ARP 


a TOM 

AIUM 


9 ft 
ZOO 


K1U 9 

in nz 


ARP 


a TOM 

AIUM 


9ft 7 
ZD / 


o 

u 


ASP 


ATOM 
Al UM 


9ft ft 
ZOO 


o 


ARP 


a TOM 
Al UM 


9ft Q 

Z 0 7 


M 

IN 




a TOM 


9Qn 

Z V \J 


PA 


GLN 


a TOM 
A 1 UM 


9Q 1 
z y 1 


PR 


PT N 


a TOM 
rt 1 UM 


9Q9 


PP 


GLN 


a TOM 
r\ 1 UM 


9Q1 


pn 


GLN 


ATOM 
/\1UM 


9Q4 
z yn 


PF 1 


PT.N 


ATOM 
AIUM 


9 Q5 

Z 


NF9 


PT M 


ATOM 
AIUM 


Z 3D 




PT N 


a TOM 
Al UM 


9 Q7 

Z 7 r 


o 
u 


PT M 


a TOM 

Al UM 


9 Qfl 

z y a 


M 

IN 


t rn 


a TOM 

AIUM 


9 QQ 


pa 

UM 


t rn 


ATOM 


300 


CB 


LEU 


ATOM 


301 


CG 


LEU 


ATOM 


302 


CDl 


LEU 


ATOM 


303 


CD2 


LEU 


ATOM 


304 


C 


LEU 


ATOM 


305 


0 


LEU 


ATOM 


306 


N 


VAL 













70S 


32 . 


726 


0. 


180 


7 0S 


33 . 


955 


4 . 


201 


705 


33 . 


970 


4 . 


587 


706 


34 . 


512 


4 . 


882 


706 


35 . 


151 


6. 


193 


706 


35 . 


739 


6. 


668 


706 


36. 


394 


8. 


029 


706 


37 . 


488 


8. 


146 


706 


37 . 


,586 


9, 


225 


706 


38. 


246 


7 . 


175 


706 


34. 


,089 


7 . 


180 


706 


34, 


,313 


8 . 


023 


707 


32. 


,927 


7 . 


076 


707 


31. 


,803 


7 . 


916 


707 


30. 604 


7 . 


579 


707 


29. 


, 318 


8 . 


262 


707 


29 . 


, 537 


9 , 


745 


707 


28 , 


.252 


7 . 


889 


707 


31. 


.461 


7 . 


634 


707 


31 . 


. 121 


8 . 


557 


70ft 
/wo 


31 . 


. 532 


6 , 


358 


70ft 


31 , 


,230 


5 . 


976 


70ft 


32 , 


,213 


6 . 


620 


70ft 


31 « 


. 849 


7 . 


061 


709 


33 . 


.468 


6 . 


687 


70Q 


34 


. 525 


7 . 


, 279 


70Q 


35 , 


. 874 


7 , 


, 046 


709 


37 


. 051 


7 , 


, 547 


70Q 


37 


. 573 


6 , 


, 514 


70ft 


36 


.766 


5 . 


, 660 


709 


38 


. 784 


6. 


, 544 


709 


34 


. 334 


8 . 


, 775 


709 


34 


. 628 


9, 


.317 


710 


33 


.84 5 


9, 


. 427 


710 


33 


. 616 


10 . 


.869 


710 


33 


. 459 


11 . 


. 346 


710 


34 


. 659 


11 . 


. 098 


71 0 


34 


. 329 


11 , 


.498 


710 


35 


.512 


1 1 , 


. 535 


710 


35 


. 587 


12 , 


.246 


710 


34 


.550 


12 


. 975 


710 


36 


.691 


12 


. 242 


7 1 0 


32 


.376 


1 1 , 


. 230 


710 


32 


.379 


12 


. 156 


71 1 


31 


.291 


10 


. 516 


71 1 


30 


.067 


10 - 


. 745 


71 1 


28 


.908 


9 , 


. 938 


711 


28 


.377 


10 , 


. 566 


711 


27 


.058 


10 , 


.010 


711 


26 


.758 


9 ( 


. 932 


711 
'ii 


26 


.228 


9 


. 677 


711 


30 


.209 


10 


. 4 94 


711 


29 


.564 


1 1 


. 183 


719 

' 1 £ 


31 


.043 


9 


. 529 


719 
fit 


31 


.259 


9 


. 227 


719 


32 


.163 


g 


. 008 


712 


32 


.522 


7 


. 607 


712 


31 


.288 


7 


.641 


712 


33 


. 132 


6 


.223 


712 


31 


.876 


10 


.428 


712 


31 


.507 


10 


.743 


713 


32 


.809 


11 


.099 



3. 


592 


1. 


00 


16. 


07 


3. 


410 


1. 


00 


17. 


17 


4. 


570 


1. 


00 


17. 


46 


2. 


415 


1. 


00 


17. 


04 


2. 


598 


1 . 


00 


17. 


55 


1. 


258 


1. 


00 


18. 


93 


1. 


282 


1. 


00 


21. 


19 


2. 


347 


1. 


00 


23. 


68 


2. 


978 


1. 


00 


25. 


14 


2. 


569 


1. 


00 


24. 


37 


3. 


069 


1. 


00 


16. 


10 


3. 


950 


1. 


00 


16.62 


2. 


445 


1. 


00 


15. 


13 


2. 


792 


1. 


00 


14. 


21 


1. 


925 


1 . 


00 


12. 


85 


2. 


328 


1 . 


00 


12. 


03 


2. 


280 


1. 


00 


13. 


09 


1. 


374 


1. 


00 


12. 


74 


4. 


228 


1. 


00 


14. 


91 


4. 


980 


1. 


00 


15. 


85 


4 . 


,602 


1. 


,00 


13. 


93 


5. 


,965 


1. 


.00 


12. 


96 


6. 


,917 


1. 


.00 


13. 


13 


7. 


.987 


1. 


.00 


13. 


55 


6. 


.514 


1. 


.00 


14 . 


14 


7. 


.322 


1. 


.00 


15. 


83 


6, 


. 658 


1 , 


.00 


16. 


,73 


7, 


.446 


1. 


.00 


18. 


.68 


8, 


.401 


1 , 


.00 


21. 


.63 


8. 


.826 


1, 


.00 


23. 


.39 


8, 


.723 


1. 


.00 


23. 


. 17 


7 


.486 


1 


.00 


16. 


.65 


8 


.563 


1 


.00 


17. 


.59 


6 


.428 


1 


.00 


16. 


.70 


6, 


.418 


1 


.00 


17. 


.32 


4 


.990 


1 


.00 


16. 


.07 


4 


.137 


1 


.00 


16. 


. 18 


2 


.706 


1 


.00 


16, 


.39 


1 


.850 


1 


.00 


15. 


.28 


0 


.733 


1 


.00 


15. 


.30 


0 


.3f>7 


1 


.00 


14 , 


.96 


0 


.001 


1 


.00 


14 . 


.89 


7 


.218 


1 


.00 


17. 


.85 


8 


.034 


1 


. 00 


17. 


.75 


6 


.955 


1 


. 00 


18, 


.71 


7 


.697 


1 


. 00 


19. 


.38 


7 


.127 


1 


.00 


19, 


.79 


5 


.878 


1 


.00 


22 


.36 


5 


. 446 


1 


.00 


23 


.37 


4 


. 244 


1 


.00 


25 


.35 


6 


.410 


1 


.00 


24 


.52 


9 


. 188 


1 


.00 


19 


.48 


9 


.985 


1 


.00 


19 


.57 


9 


.571 


1 


.00 


18 


.76 


10 


. 984 


1 


.00 


19 


.20 


11 


. 157 


1 


.00 


20 


.55 


12 


.590 


1 


.00 


21 


.95 


13 


.484 


1 


.00 


23 


.43 


12 


.586 


1 


.00 


22 


.57 


11 


.704 


1 


.00 


19 


.23 


12 


.834 


1 


.00 


17 


.65 


11 


.039 


1 


.00 


19 


.70 



WO 01/27622 



PCT/US00/28495 



43- 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



ATOM 


307 


CA 


VZ1T 


71^ 

/ X J 


ATOM 


308 


Co 


\7ZVT 


71 


ATOM 


309 




vzit 

Vnii 


713 


ATOM 


310 




\/TV T 


711 

/ X J 


ATOM 


311 


C 


vnii 


713 


ATOM 


312 


r\ 
O 


WflT 

VrvL» 


711 


ATOM 


313 


M 
N 


UTC 




ATOM 


314 


CA 


UTC 

nlO 


71 4 

f XH 


ATOM 


315 


Co 


HI O 


714 

/ XH 


ATOM 


316 




UTC 


714 

1 XH 


ATOM 


31 7 


LUZ 


UTC 


714 
/ 1 4 


ATOM 


318 


ND1 


nio 


714 


ATOM 


319 


Chi 


UTQ 

nlo 


714 

1 X H 


ATOM 


320 




UTC 


714 

1 XH 


ATOM 


321 


C 


UTC 


7 14 

1 XH 


ATOM 


322 


O 


UTC 


714 


ATOM 


O *3 

32 J 


N 


X/B.T 


7 1 S 


ATOM 


1 1 A 

324 


LA 


VnL 


7 1 S 


ATOM 


325 


s-> n 

CB 


VST 

VAL 


7 1^ 

f xD 


ATOM 


326 


CGI 




7 1 ^ 


ATOM 


327 


CG2 


VAL 


■71 C 

/ ID 


ATOM 


328 


C 


VAL 


/ID 


ATOM 


329 


O 


VAL 


•71 c 
/ID 


ATOM 


330 


N 


\1 ft T 

VAL 


"7 1 £ 
/ID 


ATOM 


331 


CA 


VAL 


/ 1 O 


ATOM 


332 


CB 


VAL 


7 1 £ 
/ ID 


ATOM 


333 


CGI 


\77\ T 

VAL. 


/ID 


ATOM 


334 


Cu2 


VAL 


7 1 
/ID 


ATOM 


335 


C 


W TV T 

VAL 


/ID 


ATOM 


336 


0 


VAL 


/ Id 


ATOM 


337 


N 


T VC 

Lib 


t 1 7 
/ 1 / 


ATOM 


338 


CA 


T V C 

Lib 


717 
/ 1 / 


ATOM 


339 


CB 


Lib 


■717 
/ 1 / 


ATOM 


340 


CG 


LYS 


TIT 

ill 


ATOM 


341 


CD 


LYS 


TIT 
I X 1 


ATOM 


342 


CE 


LYS 


/ 1 / 


ATOM 


343 


NZ 


LYS 


"7 1 T 
/ 1 / 


ATOM 


34 4 


C 


LYS 


TIT 
/ 1 / 


ATOM 


345 


0 


LYS 


TIT 
/ 1 / 


ATOM 


34 6 


N 


TRP 


T 1 Q 

/lb 


ATOM 


347 


CA 


TRP 


T 1 Q 
/ 1 0 


ATOM 


348 


CB 


TRP 


T 1 Q 


ATOM 


34 9 


CG 


TRP 


T 1 O 

/lb 


ATOM 


350 


CD2 


TRP 


T 1 O 
/lb 


ATOM 


351 


CE2 


TRP 


7 T Q 


ATOM 


352 


Ct-3 


TRP 


/ 1 0 


ATOM 


353 


CD1 


TRP 


/lb 


ATOM 


354 


NE1 


TRP 


T 1 O 

t lo 


ATOM 


355 


CZ2 


TRP 


T 1 O 
/lb 


ATOM 


356 


CZ3 


TRP 


1 1 O 
/ 1 O 


ATOM 


357 


CH2 


TRP 


T 1 O 
/lb 


ATOM 


356 


C 


TRP 


718 


ATOM 


359 


0 


TRP 


T 1 O 

71b 


ATOM 


360 


N 


ALA 


T 1 Q 
/ 19 


ATOM 


Jol 


CA 


ft T ft 

ALA 


71 Q 

t XV 


ATOM 


362 


CB 


ALA 


719 


ATOM 


363 


C 


ALA 


719 


ATOM 


364 


O 


ALA 


719 


ATOM 


365 


N 


LYS 


720 


ATOM 


366 


CA 


LYS 


720 


ATOM 


.367 


CB 


LYS 


720 


ATOM 


368 


CG 


LYS 


720 



33 . 


427 


12. 


270 


11. 


619 


1. 


00 


19. 


68 


34. 


453 


12. 


859 


10. 


658 


1. 


00 


20. 


01 


34. 


722 


14 . 


292 


11. 


001 


1. 


00 


21. 


16 


35. 


750 


12. 


069 


10.750 


1. 


00 


20.06 


32. 


328 


13. 


277 


11. 


990 


1. 


00 


19. 


53 


32. 


325 


13. 


802 


13. 


086 


1. 


00 


20. 


00 


31. 


330 


13. 


434 


11. 


128 


1. 


00 


19. 


20 


30. 


215 


14 . 


356 


11 . 


358 


1. 


00 


19. 


47 


29.498 


14 . 


658 


10. 


038 


1. 


00 


20.77 


30. 


331 


15. 


410 


9. 


058 


1. 


00 


21. 


60 


31. 


369 


15. 


016 


8. 


283 


1. 


00 


22. 


31 


30. 


131 


16. 


744 


8 . 


784 


1. 


00 


22. 


32 


31. 


005 


17. 


139 


7 . 


876 


1. 


00 


23. 


41 . 


31. 


768 


16. 


113 


7 . 


557 


1. 


00 


23. 


22 


29. 


183 


13. 


885 


12 . 


383 


1. 


00 


18. 


83 


28. 


497 


14 . 


701 


13. 


005 


1. 


00 


18. 


73 


29. 


006 


12. 


572 


12. 


485 


1. 


00 


18. 


39 


28. 


063 


11. 


972 


13. 


434 


1. 


00 


16.86 


27. 


869 


10. 


435 


13. 


134 


1. 


00 


16. 


78 


27. 


037 


9. 


756 


14 . 


197 


1. 


00 


17. 


10 


27. 


183 


10. 


259 


11 . 


817 


1. 


00 


17. 


34 


28. 


667 


12. 


,166 


14 . 


817 


1 . 


00 


15. 


60 


27. 


958 


12. 


,422 


15. 


788 


1. 


,00 


15. 


49 


29. 986 


12. 


,077 


14 . 


913 


1. 


.00 


15 . 


. 13 


30 . 


, d22 


12. 


.250 


16. 


205 


1. 


,00 


15. 


,01 


32 . 


1 7 C 

. 13b 


11. 


.885 


16. 


, 158 


1. 


,00 


14 . 


, 93 


32 . 


. 82b 


12, 


.233 


17 . 


. 481 


1. 


.00 


13. 


.26 


32 , 


, 310 


10, 


,373 


15, 


, 870 


1, 


.00 


14 , 


.26 


30 . 


A 1 O 


13, 


.681 


16. 


.708 


1. 


.00 


15 . 


.83 


30 , 


. 12? 


13, 


.883 


17 . 


.887 


1. 


.00 


16, 


. 61 


■jn 
OU , 


. D H ** 


14 


.665 


15 , 


.816 


1. 


.00 


16 , 


.59 


30 


.390 


16 


.082 


16 , 


. 183 


1. 


.00 


17 , 


. 20 


30 


.884 


16 


.974 


15 . 


.041 


1 


.00 


18 


. 94 


32 


.361 


16 


.747 


14 


. 698 


1 


.00 


22 


. 56 


33 


.245 


16 


.752 


15, 


. 978 


1 


.00 


25 


. 34 


34 


.294 


15 


. 609 


16 


.007 


1 


.00 


27 


. 06 


34 


.709 


15 


.195 


17 


.410 


1 


.00 


27 


.21 


28 


.951 


16 


.387 


16 


.534 


1 


.00 


16 


.77 


28 


.658 


16 


.931 


17 


. 593 


1 


.00 


18 


.49 


28 


.049 


15 


.976 


15 


. 659 


1 


.00 


15 


. 68 


26 


.618 


16 


.143 


15 


.868 


1 


.00 


14 


.61 


25 


.889 


15 


.442 


14 


. 689 


1 


.00 


11 


. 97 


24 


.433 


15 


.266 


14 


.841 


1 


.00 


9 


,66 


23 


.757 


14 


.069 


15 


.254 


1 


.00 


10 


.28 


22 


.373 


14 


.371 


15 


.293 


1 


.00 


9 


.98 


24 


.176 


12 


.773 


15 


. 612 


1 


.00 


10 


.09 


23 


.472 


16 


. 199 


14 


. 64 5 


1 


.00 


9 


.89 


22 


.228 


15 


. 688 


1 4 


. 918 


1 


.00 


8 


.38 


21 


.394 


13 


.419 


15 


. 663 


1 


.00 


9 


.00 


23 


.201 


11 


.835 


15 


. 980 


1 


.00 


8 


.20 


21 


.835 


12 


. 171 


16 


.004 


1 


.00 


7 


.32 


26 


.200 


15 


. 562 


17 


.261 


1 


.00 


15 


.34 


25 


.659 


16 


.269 


18 


.124 


1 


.00 


14 


.55 


26 


.468 


14 


.272 


17 


.464 


1 


.00 


16 


.10 


26 


.143 


13 


.559 


18 


.68 3 


1 


.00 


15 


.03 


26 


.796 


12 


. 184 


18 


.657 


1 


.00 


13 


.59 


26 


.623 


14 


. 346 


19 


.881 


1 


.00 


15 


.62 


25 


.857 


14 


.646 


20 


.785 


1 


.CO 


15 


.85 


27 


.870 


14 


.781 


19 


.828 


1 


.00 


17 


.45 


28 


.463 


15 


.516 


20 


.924 


1 


.00 


18 


.63 


29 


.970 


15 


.625 


20 


.715 


1 


.00 


19 


.81 


30 


. 644 


14 


.292 


21 


.012 


1 


.00 


21 


.18 



WO 01/27622 



PCT/USOO/28495 



-44- 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
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372 
37 3 



377 
378 



PRO 
PRO 
GLY 



GLY 
PHE 



369 CD LYS 

370 CE LYS 

371 NZ LYS 
C LYS 
O LYS 

374 N ALA 

375 CA ALA 
37 6 CB ALA 

C ALA 
O ALA 
37 9 N LEU 

380 CA LEU 

381 CB LEU 

382 CG LEU 

383 CD1 LEU 

384 CD2 LEU 

385 C LEU 

386 O LEU 

387 N PRO 

388 CD PRO 

389 CA PRO 

390 CB PRO 

391 CG PRO 

392 C 

393 O 

394 N 

395 CA GLY 

396 C GLY 

397 O 

398 N 

399 CA PHE 

400 CB PHE 

401 CG PHE 

402 CD1 PHE 

403 CD2 PHE 

404 CE1 PHE 

405 CE2 PHE 

406 CZ PHE 

407 C PHE 

408 O PHE 

409 N ARG 

410 CA ARG 

411 CB ARG 

412 CG ARG 

413 CD ARG 

414 NE ARG 

415 CZ ARG 

416 NH1 ARG 

417 NH2 ARG 

418 C ARG 

419 O ARG 

420 N ASN 

421 CA ASN 

422 CB ASN 

423 CG ASN 

424 OD1 ASN 

425 ND2 ASN 

426 C ASN 

427 O ASN 

428 N LEU 

429 CA LEU 
4 30 CB LEU 



720 
720 
720 
720 
720 
721 
721 
721 
721 
721 
722 
722 
722 
722 
722 
722 
722 
722 
723 
723 
723 
723 
723 
723 
723 
724 
724 
724 
724 
725 
725 
725 
725 
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725 
725 
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725 
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726 
726 
726 
726 
726 
726 
726 
726 
726 
726 
726 
727 
727 
727 
727 
727 
727 
727 
727 
728 
728 
728 



32.136 
32.762 
32.729 
27.822 
27.921 
27.070 
26.406 
26.584 
24.941 
24.192 
24.518 
23.119 
22.955 
21.855 
21.540 
22.298 
22.754 
23.521 
21.574 
20.500 
21.211 
19.767 
19.706 
21.266 
20.821 
21.800 
21.874 
22.838 
23.076 
23.434 
24 .360 
24.915 
25.648 
24 .944 
27 .046 
25.623 
27.731 
27.025 
25.505 
25.873 
26.083 
27.207 
27.831 
28.622 
29.759 
30.657 
31.872 
32.635 
32.329 
26.902 
27.797 
25.632 
25.244 
23.717 
23.118 
21.947 
23.909 
25.750 
25.992 
25.895 
26.362 
26.191 



14.334 
12.975 
12.661 
16.860 
17.377 
17.369 
18.651 
19.461 
18.492 
19.485 
17.247 
16.912 
15.395 
14.771 
15.657 
13.382 
17.362 
17.125 
17.992 
18.317 
18.428 
18.917 
19.349 
17.287 
16.165 
17.588 
16.598 
15.478 
14.658 
15.446 
14.368 
14.554 
13.353 
12.239 
13.328 
11.130 
12.226 
11.123 
14 .170 
13.028 
15.270 
15.229 
16.620 
17.087 
16.141 
16.595 
16.090 
16.558 
15.109 
14.615 
14 . 449 
14 .316 
13.695 
13.687 
15.085 
15.277 
16.055 
12.261 
11.689 
11.673 
10.289 
9.759 



20.860 
21.244 
22.708 
21.204 
22.321 
20.238 
20.382 
19.146 
20.675 
20.660 
20.904 
21.207 
21.119 
20.271 
19.099 
19.815 
22.616 
23.549 
22.811 
21.861 
24.167 
23.997 
22.624 
25.195 



24 .935 
26.369 
27.416 
27.132 
28.004 
25.946 
25.610 
24 .244 
23.703 
23.260 
23.675 
22.804 
23.221 
22.784 
26.592 
26.863 
27.070 
28.033 
28 . 204 
26.995 
26.727 
25.670 
25. 464 
24.486 
26.232 
29.423 
30.253 
29.683 
30.938 
31.115 
31.195 
30.893 
31.628 
30.934 
31.994 
29.749 
29.625 
28.190 



1.00 23.81 
1.00 25.84 
1.00 26.70 
1.00 18.98 
1.00 19.86 
1.00 18.21 
1.00 18.10 
1.00 17.43 
1.00 18.80 
1.00 19.16 
1.00 19.00 
1.00 19.60 
1.00 19.45 
1.00 19.62 
1.00 17.02 
1.00 17.38 
1.00 20.27 
1.00 21.72 
1.00 20.69 
1.00 20.29 
1.00 21.24 
1.00 20.40 
1.00 20.05 
1.00 21.66 
1.00 21.14 
1.00 22.02 
1,00 22.29 
1.00 23.13 
1.00 23.78 
1.00 24.14 
1.00 24.24 
1.00 23.59 
1.00 23.80 
1.00 22.83 
1.00 22.40 
1.00 22.77 
1.00 21.05 
1.00 22.31 
1 .00 24 .85 
1.00 23.79 
00 25.97 



1.00 27,63 
1.00 29.27 
1.00 31.68 
1.00 34.22 
1.00 37.18 
1.00 38.28 
1.00 39.44 
1.00 38.78 
1.00 27.24 
1.00 27.08 
1.00 27.15 
1.00 26.29 
1.00 25.46 
1.00 24.95 
1.00 24.88 
1.00 24.54 
1.00 26.17 
1.00 27.30 
1.00 25.64 
1.00 25.16 
1.00 22.98 
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ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 



4 31 CG LEU 728 

432 GDI LEU 728 

433 CD2 LEU 728 

434 C LEU 728 

435 O LEU 728 

436 N HIS 729 

437 CA HIS 729 

438 CB HIS 729 

439 CG HIS 729 

440 CD2 HIS 729 

441 ND1 HIS 729 

442 CE1 HIS 729 

443 NE2 HIS 729 

444 C HIS 729 

445 O HIS 729 

446 N VAL 730 

447 CA VAL 730 

448 CB VAL 730 
44 9 CGI VAL 730 
4 50 CG2 VAL 730 
451 C VAL 730 
4 52 O VAL 7 30 

453 N ASP 731 

454 CA ASP 731 

455 CB ASP 731 

456 CG ASP 731 

457 OD1 ASP 731 

458 OD2 ASP 731 

459 C ASP 731 

460 O ASP 731 

461 N ASP 732 

462 CA ASP 732 

463 CB ASP 732 

464 CG ASP 732 

465 OD1 ASP 732 

466 OD2 ASP 732 

467 C ASP 732 

468 O ASP 732 

469 N GLN 733 

470 CA GLN 733 

471 CB GLN 733 

472 CG GLN 733 

473 CD GLN 733 

474 OE1 GLN 733 

475 NE2 GLN 733 

476 C GLN 733 

477 O GLN 733 

478 N MET 734 

479 CA MET 734 

480 CB MET 734 

481 CG MET 734 

482 SD MET 734 

483 CE MET 734 

484 C MET 734 

485 O MET 734 

486 N ALA 735 

487 CA ALA 7 35 

488 CB ALA 735 

489 C ALA 735 

490 O ALA 735 
4 91 N VAL 7 36 
4 92 CA VAL 7 36 



24.859 
25.084 
23.856 
27.833 
28.571 
28,247 
29.642 
29.737 
31.132 
32.276 
31.460 
32.744 
33.263 
30.450 
30.003 
31.681 
32.592 
34.036 
35.077 
34.074 
32.662 
32.704 
32.770 
32.819 
33.244 
32.966 
31.837 
33.867 
31.474 
31.408 
30.403 
29.079 
28.024 
28.073 
28.728 
27.444 
28.770 
28.030 
29.288 
29.121 
29.942 
29.359 
30.208 
30.018 
. 31.130 
29.636 
28.979 
30.853 
31.545 
33.003 
33.749 
35.293 
34 .884 
30.902 
30.732 
30.571 
29.939 
29.650 
28.644 
28.393 
27.799 
26.516 



9.711 
9.133 
8.883 
10.208 
11.157 
9.064 
8.808 
7.455 
7.042 
6.978 
6.603 
6.293 
6.510 
8.772 
8.182 
9.295 
9.365 
9.793 
9.446 
11.284 
8.108 
8.192 
6.956 
5.709 
536 
152 
619 
599 
425 
4.912 
5.760 
5.510 
5.711 
4.654 
3.592 
4.904 
6.387 
5.982 
7.612 
8.591 
9.847 
10.776 
12.013 
12.696 
12.316 
7.997 
8.075 
7.459 
6.832 
6.596 
5.604 
5.121 
401 
510 
247 
671 
390 
683 
570 
905 
460 
734 



3. 
5. 
5. 
4 . 
3. 
2. 
3. 
2. 
4. 
4 . 



27.448 
26.076 
28.203 
29.974 
29.739 
30.516 
30.871 
31.570 
31.943 
31.218 
33.209 
33.247 
32.049 
29.577 
28.594 
29.625 
28.465 
28 .898 
27.811 
29.176 
27 . 600 
26.371 
28.244 
27.509 
28.410 
27.771 
27.974 
27.075 
26.889 
25.789 
27.587 
27.057 
28.119 
29.186 
28.984 
30.231 
25.875' 
24 .981 
25.920 
24.855 
25. 166 
26.225 
26. 480 
27.477 
25.577 
23.557 
22.522 
23.625 
22.508 
22.906 
22.047 
22.821 
23.387 
22 .077 
20.884 
23.052 
22.788 
24.110 
22.013 
21.015 
22.501 
21.877 



1.00 20.68 
1.00 19.66 
1.00 19.79 
1.00 25.79 
1.00 25.15 
1.00 27.05 
1.00 28.68 
1.00 30.72 
1.00 33.13 
1.00 33.50 
1.00 34.21 
1.00 34.64 
1.00 34.52 
1.00 29.25 
1.00 29.28 
1.00 30.21 
1.00 31.02 
1.00 32.08 
1.00 32.91 
1.00 31.86 
1.00 30.89 
1.00 30.40 
1.00 30.81 
1.00 31.55 
00 36.23 
00 40.32 
00 42.21 
00 42.23 
1.00 28.94 
1.00 29.47 
1.00 26.17 
1.00 24.53 
1.00 24 .04 
1.00 23.64 
1.00 22.31 
1.00 23.89 
1.00 23.96 
1.00 22.56 
1.00 23.56 
1.00 22.94 
00 22.73 
00 23.24 
00 23.27 
00 24 .33 
00 23.47 
1.00 23.20 
1.00 23.08 
1.00 23.37 
1.00 24.31 
1.00 27.26 
00 31.61 
00 39.54 
00 37.92 
00 23.31 
00 23.35 
1.64 



00 



1.00 20.4 4 
1.00 20.71 



.00 19.23 
.00 19.59 
.00 18.69 



1.00 17.76 
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ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 



4 93 CB VAL 
4 94 CGI VAL 
4 95 CG2 VAL 

496 C VAL 

497 o VAL 
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16.225 14.636 

15.375 15.705 

12.907 15.926 

11.775 13.780 

11.756 15.188 

16.266 11.995 

15.457 12.558 

17.569 11.971 

18.150 12.616 

19.644 12.380 

20.370 12.908 

21.870 12.901 

22.467 11.573 

22.976 10.973 

22.928 11.561 

23.684 9.864 

17.572 12.077 

17.392 12.815 

17.391 10.761 
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16.716 8.590 
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15.434 10.635 

14.978 10.803 

14.762 10.870 

13.405 11.375 
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13.239 12.818 
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13.823 13.732 
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10.678 11.894 
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5.364 1.00 13.27 
5.538 1.00 12.27 
5.899 1.00 14.74 
5.942 1.00 14.63 
6.020 1.00 14.82 
2.857 1.00 18.41 
2.124 1.00 20.13 
2.607 1.00 19.13 
1.431 1.00 19.06 
1.389 1.00 18.53 
2.567 1.00 18.25 
2.317 1.00 17.24 
2.298 1.00 14.94 
3.370 1.00 14.90 
4.554 1.00 14.75 
3.240 1.00 13.87 
0.138 1.00 20.27 
-0.828 1.00 20.66 
0.083 1.00 22.00 
-1.093 1.00 22.25 
-0.879 1.00 20.25 
-0.687 1.00 19.78 
-1.289 1.00 23.31 
-2.409 1.00 23.88 
-0.173 1.00 24.76 
-0.156 1.00 26.45 
1.245 1.00 26.43 
1.364 1.00 28.06 
0.654 1.00 28.69 
2.118 1.00 27.96 
0.687 1.00 29.47 
2.160 1.00 28.93 
1.445 1.00 29.47 
-0.630 1.00 27.47 
-1.614 1.00 26.87 
0.125 1.00 29.01 
-0.190 1.00 30.83 
0.918 1.00 29.71 
1.183 1.00 28.99 
2.164 1.00 29.63 
-1.552 1.00 32.57 
-2.234 1.00 33.24 
-1 . 991 1.00 34 .71 
-3.273 1.00 36.48 
-3.149 1.00 36.89 
-2.344 1.00 37.02 
-1.456 1.00 39.11 
-2.668 1.00 37.24 
-4 .457 1.00 37.88 
-5.313 1.00 39.57 
-4.518 1.00 38.99 
-5.633 1.00 39.74 
-5.971 1.00 39.96 
-6. 170 1.00 40.00 
-4 .871 1.00 39.28 
-5.456 1.00 39.95 
-6. 134 1.00 40. 19 
-4 .559 1.00 40.46 
-4.265 1.00 41.28 
-5.288 1.00 43.82 
-5.005 1.00 44.84 
-4 .764 1.00 46.40 
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3.188 1.00 32.05 

3.235 1.00 32.56 

2.001 1.00 32.55 

2.231 1.00 26.08 

2.394 1.00 25.72 

3.048 1.00 25.92 

4.218 1.00 25.41 

4.887 1.00 23.94 

6.383 1.00 23.32 

6.570 1.00 22.41 

7.014 1.00 23.17 

3.720 1.00 25.47 
4.216 1.00 25.06 
2.757 1.00 25.60 
2. 177 1 .00 26.51 
0.973 1.00 28.35 
1.179 1.00 32.32 
1.716 1.00 26.02 
1.885 1.00 26.00 
1.122 1.00 26.31 
0.610 1.00 26.32 

-0.112 1.00 28.65 
-1.603 1.00 30.43 
-2.277 1.00 31.95 
-3.062 1.00 33.53 
-2.007 1.00 32.31 

1.721 1.00 24.88 
1.528 1.00 24.96 
2.907 1.00 23.72 
4.051 1.00 22.89 

5.273 1.00 23.65 
5.055 1.00 25.18 
5.651 1.00 27.08 
5.351 1.00 25.97 
6.433 1.00 28.69 
4 .400 1.00 21 .58 
4.764 1.00 21.52 

4 .274 1.00 21.26 
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989 CA PHE 794 8.999 

990 CB PHE 794 10.334 

991 CG PHE 794 11.413 

992 CD1 PHE 794 11.226 

993 CD2 PHE 794 12.599 

994 CE1 PHE 794 12.206 

995 CE2 PHE 794 13.570 

996 CZ PHE 794 13.374 

997 C PHE 794 7.929 

998 O PHE 794 7.387 

999 N GLY 795 7.688 
1000 CA GLY 795 6.676 

C GLY 795 5.309 

O GLY 795 4.414 

1003 N TRP 796 5.181 

1004 CA TRP 796 3.931 

1005 CB TRP 796 4.135 

1006 CG TRP 796 4.478 

1007 CD2 TRP 796 5.208 

1008 CE2 TRP 796 5.312 

1009 CE3 TRP 796 5.777 

1010 CD1 TRP 796 4.177 

1011 NE1 TRP 796 4.676 

1012 CZ2 TRP 796 5.967 

1013 CZ3 TRP 796 6.427 

1014 CH2 TRP 796 6.514 
C TRP 796 3.345 
O TRP 796 2.132 
N LEU 797 4.223 

1013 CA LEU 797 

1019 CB LEU 797 

1020 CG LEU 797 

1021 CD1 LEU 797 

1022 CD2 LEU 797 
C LEU 7 97 
O LEU 7 97 

1025 N GLN 798 

1026 CA GLN 798 

1027 CB GLN 798 

1028 CG GLN 798 

1029 CD GLN 798 

1030 OE1 GLN 798 

1031 NE2 GLN 798 

1032 C GLN 798 

1033 O GLN 798 

1034 N ILE 799 

1035 CA ILE 799 

1036 CB ILE 799 

1037 CG2 ILE 799 

1038 CGI ILE 799 

1039 CD1 ILE 799 

1040 C ILE 799 

1041 O ILE 799 

1042 N THR 800 

1043 CA THR 800 

1044 CB THR 800 

1045 OG1 THR 800 

1046 CG2 THR 800 

1047 C THR 800 9.475 

1048 O THR 800 10.069 

1049 N PRO 801 10.116 

1050 CD PRO 801 9.618 



1015 
1016 
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1023 
1024 



5. 
4 . 
6. 
7. 



9.208 
9.890 
9.541 
9.728 
8.974 
9.347 
8.593 
8.777 
9.863 

10.906 
9.270 
9.750 
9.232 
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7.912 
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3.763 
3.556 
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709 
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4 .415 
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716 
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II. 747 
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14.552 
14.790 
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4.323 
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3.759 
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1.662 
2.037 
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2.081 
2.428 
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1. 187 
0. 985 



1.00 20.65 
1.00 19.81 
1.00 19.96 
1.00 20.01 
1.00 19.43 
1.00 19.86 
1.00 18.95 
1.00 19.37 
1.00 22.26 
1.00 22.19 
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4 , 
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-0.417 
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801 11.555 17.600 

801 11.797 18.983 

801 10.908 18.956 

801 11.907 17.570 

801 12.981 17.101 

802 10.982 18.045 
802 11.189 18.079 
802 10.316 19.162 
802 10.582 20.596 
802 9.997 20.900 
802 8.948 20.381 
802 10.660 21.782 
802 10.968 16.715 

802 11.599 16.415 

803 10.064 15.904 
803 9.797 14.558 
803 8.632 13.897 
803 7.277 14.434 
803 6.147 13.786 
803 6.308 13.392 
803 5.065 13.680 
803 11.067 13.784 

803 11.537 13.042 

804 11.612 14.001 
804 12.863 13.418 
804 13.144 13.867 
804 14.557 13.645 
804 15.012 12.380 
804 15.440 14.706 
804 16.335 12.160 
804 16.765 14.496 
804 17.214 13.217 
804 14.034 13.802 

804 14.807 12.939 

805 14.187 15.086 
805 15.271 15.503 
805 15.250 17.003 
805 15.552 17.834 
805 15.704 19.281 
805 16.816 17.343 
805 15.172 14.767 

805 16.142 14.205 

806 13.980 14.719 
806 13.765 14.026 
806 12.372 14.332 
806 12,142 16.017 
806 13.938 12.515 

806 14.575 11.904 

807 13.348 11.903 
807 13.491 10.458 
807 12.668 9.944 
807 11.195 9.877 
807 10.377 9.142 
807 10.144 10.560 
807 14.947 10.062 

807 15.371 9.039 

808 15.712 10.871 
808 17.116 10.592 
808 17.729 11.514 
808 19.171 11.154 
808 19.679 11.569 
808 19.422 13.053 
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32.226 1.00 27.58 

34.122 1.00 27.13 

34.595 1.00 26.37 

36.113 1.00 28.08 

36.792 1.00 31.02 

38.335 1.00 33.41 

39.002 1.00 34.57 

38.739 1.00 35.55 

33.876 1.00 24.74 

33.799 1.00 25.30 

33.330 1.00 24.06 
32.597 1.00 23.44 
32.573 1.00 25.61 
33.930 1.00 28.86 
34.890 1.00 29.49 
34.268 1.00 29.08 
36.156 1.00 29.57 
35.533 1.00 29.80 
36.479 1.00 30.14 
31.178 1.00 21.78 
30.568 1.00 21.42 
30.636 1.00 20.42 
29.299 1.00 18.74 
28.857 1.00 17.62 
27.610 1.00 16.77 
26.380 1.00 16.74 
27.656 1.00 16.46 
25.212 1.00 16.50 
26.491 1.00 17.41 
25.266 1.00 16.35 
29.277 1.00 19.19 

28.331 1.00 18.79 

30.312 1.00 20.42 
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31.633 1.00 23.37 
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32.334 1.00 35.29 
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31.360 1.00 34.89 
30.174 1.00 22.78 
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27.215 1.00 21.40 
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26.481 1.00 23.35 
27.079 1.00 22.23 
26.097 1.00 22.37 
26.186 1.00 22.50 
25.219 1.00 22.59 
25.375 1.00 23.29 
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We claim: 

1. A crystal of an AR-LBD comprising: 

a) an AR-LBD and an AR-LBD ligand or 

b) an AR-LBD without an AR-LBD ligand; 

5 wherein said crystal diffracts to at least 3 angstrom resolution and has a 
crystal stability within 5% of its unit cell dimensions. 

2. The crystal of claim 1 wherein said AR-LBD has at least 200 amino 
acids. 

3. The crystal of claim 1, wherein said AR-LBD is the AR amino acid 
1 0 sequence 672 to 9 1 7 of rat AR 

4. The crystal of claim 1, wherein said AR-LBD is the AR amino acid 
sequence 672 to 917 of human AR. 

5. The crystal of claim 1 wherein the crystal comprises an AR-LBD and 
an AR-LBD ligand and the AR-LBD ligand is an agonist or 

1 5 antagonist, a partial agonist or partial antagonist, or a SARM s of the 

AR-LBD. 

6. The crystal of claim 5 wherein the agonist is dihydrotestosterone. 

7. The crystal of claim 1 having all of the coordinates listed in Table A. 

8. The crystal of claim 1 wherein said crystal comprises mammalian 
20 AR-LBD protein. 

9. The crystal of claim 1 wherein said crystal comprises rat AR-LBD 

protein. 

10. The crystal of claim 1 wherein said AR-LBD ligand has the 
following unit cell dimensions in angstroms: a = 56.03 ± 5% , t 

25 = 66.27 ± 5%, c = 70.38 ± 5% and an orthorhombic space group 

P212121. 

11. A molecule or molecular complex comprising all or any part of 
the ligand binding site defined by structure coordinates of AR-LBD 
amino acids V685, L700, L701, S702, S703, L704, N705, E706, 

30 L707, G708, E709, Q711, A735, 1737, Q738, Y739, S740, W741, 

M742, G743, L744, M745, V746, F747, A748, M749, G750, R752, 
Y763, F764, A765, L768, F770, M780, M787, 1869, L873, H874, 
F876, T877 and F878 according to Table A, or a mutant or 
homologue of said molecule or molecular complex. 

35 
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12. The molecule or molecular complex of claim 1 1 wherein said 
mutant or homologue comprises a binding pocket that has a root 
mean square deviation from the backbone atoms of said AR-LBD 
amino acids of not more than 1.5 Angstroms or 30% sequence 

5 identity with said AR-LBD amino acids. 

13. A molecule or molecular complex comprising all or any part of 
the ligand binding site defined by structure coordinates of AR-LBD 
amino acids N705, Q71 1, R752, F764 and T877 according to Table A, 
or a mutant or homologue of said molecule or molecular complex. 

10 14. The molecule or molecular complex of claim 13 wherein said 
mutant or homologue comprises a binding pocket that has a root 
mean square deviation from the backbone atoms of said AR-LBD 
amino acids of not more than 1.5 Angstroms or 30% sequence 
identity with said AR-LBD amino acids. 

15 15. A machine-readable data storage medium comprising a data 
storage material encoded with machine readable data, wherein the 
data is defined by the structure coordinates of an AR-LBD/ AR-LBD 
ligand or ligand complex according to Table A or a homologue of said 
complex, wherein said homologue comprises backbone atoms that 

20 have a root mean square deviation from the backbone atoms of the 

complex of not more than 3.0A 

16. The machine-readable data storage medium according to claim 
15, wherein said AR-LBD/ AR-LBD ligand or ligand complex is 
homologue having a root mean square deviation from the backbone 

25 atoms of said amino acids of not more than 2.0 A. 

17. A machine-readable data storage medium comprising a data 
storage material encoded with a first set of machine readable data 
comprising a Fourier transform of at least a portion of the structural 
coordinates for an AR-LBD/ AR-LBD ligand according to Table A; 

30 which, when combined with a second set of machine readable data 

comprising an X-ray diffraction pattern of a molecule or molecular 
complex of unknown structure, using a machine programmed with 
instructions for using said first set of data and said second set of 
data, can determine at least a portion of the structure coordinates 

35 corresponding to the second set of machine readable data, said first 

set of data and said second set of data. 
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18. A binding site in AR-LBD for an AR modulator in which a 
portion of said ligand is in van der Walls contact or hydrogen 
bonding contact with any portion or all of residues V685, L700, 

5 L701, S702, S703, L704, N705, E706, L707, G708, E709, Q711, 

A735', 1737, Q738, Y739, S740, W741, M742, G743, L744, M745, 
V746*, F747, A748, M749, G750, R752, Y763, F764, A765, L768. 
F770,' M780, M787, 1869, L873, H874, F876, T877, F878, L880, 
L881,' V889, F891, P892, E893, M894, M895, A896, E897, 1898, 
10 1899,' S900, V901, Q902, V903, P904 or 1906 of AR-LBD according to 

Table A. 

19. The binding site according to claim 18 wherein the AR-LBD is a 
homologue or mutant with 25%-95% identity to residues V685, L700, 
L701, S702, S703, L704, N705, E706, L707, G708, E709, 0711, 

15 A735' 1737, Q738, Y739, S740, W741, M742, G743, L744, M745, 

V746', F747, A748, M749, G750, R752, Y763, F764, A765, L768, 
F770,' M780, M787, 1869, L873, H874, F876, T877, F878, L880, 
L88l] V889, F891, P892, E893, M894, M895, A896, E897, 1898, 
1899, S900, V901, Q902, V903, P904 or 1906 of AR-LBD according to 

20 Table A. 

20. A method of obtaining structural information about a molecule 
or a molecular complex of unknown structure by using the structure 
coordinates set forth in Table A, comprising the steps of: 

a. generating X-ray diffraction data from said crystallized molecule 
25 or molecular complex; 

b. applying at least a portion of the structure coordinates set forth 
in Table A to said X-ray diffraction pattern to generate a three- 
dimensional electron density map of at least a portion of the 
molecule or molecular complex; and 

30 c. using all or a portion of the structure coordinates set forth in 

Table A to generate homology models of AR-LBD or any other 
nuclear hormone receptor ligand binding domain. 

21. A computational method of designing an androgen receptor 
synthetic ligand comprising: 

35 a . using a three dimensional model of a crystallized protein 

comprising an AR-LBD /AR-LBD ligand complex to determine 
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at least one interacting amino acid of the AR-LBD that 
interacts with at least one first chemical moiety of the AR- 
LBD ligand; and 
b. selecting at least one chemical modification of said first 

5 chemical moiety to produce a second chemical moiety with a 

structure that either decreases or increases an interaction 
between said interacting amino acid and said second 
chemical moiety compared to said interaction between said 
interacting amino acid and said first chemical moiety. 

10 22 . A method for identifying a compound that modulates androgen 
receptor activity, the method comprising any combination of steps of: 

a. modeling test compounds that fit spatially into the AR-LBD as 
defined by structure coordinates according to Table A, or using 
a three-dimensional structural model of AR-LBD, mutant AR- 

1 5 LBD or AR-LBD homologue or portion thereof; 

b. using said structure coordinates or ligand binding site as set 
forth in claim 18 to identify structural and chemical features; 

c. employing identified structural or chemical features to design or 
select compounds as potential AR modulators; 

20 d. employing the three-dimensional structural model or the ligand 

binding site to design or select compounds as potential AR 
modulators; 

e. synthesizing the potential AR modulators; 

f. screening the potential AR modulators in an assay 

25 characterized by binding of a test compound to the AR-LBD; 

and 

g. modifying or replacing one or more amino acids from AR-LBD 
selected from the group consisting of V685, L700, L701, S702, 
S703, L704, N705, E706, L707, G708, E709, Q71 1, A735, 1737, 

30 Q738, Y739, S740, W741, M742, G743, L744, M745, V746, 

F747, A748, M749, G750, R752, Y763, F764, A765, L768, 
F770, M780, M787, 1869, L873, H874, F876, T877, F878, 
L880, L881, V889, F891, P892, E893, M894, M895, A896, 
E897, 1898, 1899, S900, V901, Q902, V903, P904 or 1906 of 

35 AR-LBD according to Table A. 
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23. The method according to claim 22 wherein the potential AR 
modulator is from a library of compounds. 

24. The method according to claim 22 wherein the potential AR 
modulator is selected from a database. 

5 25. The method according to claim 22 wherein the potential AR 

modulator is designed de novo. 
26. The method according to claim 22 wherein the potential AR 

modulator is designed from a known agonist, partial agonist, 

antagonist, partial antagonist or SARMs. 
10 27. The method according to claim 22 wherein the potential AR 

modulator is an agonist or partial agonist and AR activity is 

measured by translocation or unwinding or helix 12. 

28. The method according to claim 22 wherein the potential AR 
modulator is an antagonist or partial antagonist and AR activity is 

15 measured by translocation or unwinding or helix 12. 

29. An AR modulator identified by the method of claim 22. 

30. A method for treating prostate cancer comprising administering 
an effective amount of an AR modulator identified by the method of 
claim 22. 

20 31. A method for treating an age related disease comprising 

administering an effective amount of an AR modulator identified by 
the method of claim 22. 
32. The method of claim 31 wherein said age related disease is 
osteoporosis, muscle wasting or loss of libido. 
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